Mor Geva

48 papers · 2018–2026 · 8 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🌍 Conference Polyglot (8) 🏃 Academic Marathon (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (4)

🐝 Cross-Pollinator (4) 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (65) 🏠 Conference Loyalist (20) 🔬 Deep Specialist (24) 🏆 Keyword Champion (3) 🤝 Dynamic Duo (11) 🔥 Unstoppable (8) 🗃️ Keyword Collector (196) 💎 Century Club (46) ❓ The Questioner (8) ⚡ Prolific Year (10)

Conferences

EMNLP (20) ACL (15) EACL (4) COLING (2) ICLR (2) ICML (2) NAACL (2) IJCNLP (1)

Top co-authors

Jonathan Berant (11) Yoav Goldberg (5) Amir Globerson (5) Daniela Gottesman (5) Roee Aharoni (4) Sohee Yang (4) Avi Caciularu (4) Nora Kassner (3) Sebastian Riedel (3) Ankit Gupta (3)

Keywords

language model (9) large language model (7) transformer model (6) question answering (6) representation learning (6) model interpretability (5) transfer learning (5) multi-hop reasoning (3) hidden representation (3) knowledge retrieval (3) mechanistic interpretability (3) dataset bia (3) latent reasoning (3) commonsense reasoning (2) transformer language model (2) attention mechanism (2) model generalization (2) model alignment (2) in-context learning (2) natural language understanding (2)

Papers

Constructing Interpretable Features from Compositional Neuron Groups ACL 2026 Detecting (Un)answerability in Large Language Models with Linear Directions EACL 2026 How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts? EMNLP 2025 Intrinsic Test of Unlearning Using Parametric Knowledge Traces EMNLP 2025 Precise In-Parameter Concept Erasure in Large Language Models EMNLP 2025 Language Models Encode Numbers Using Digit Representations in Base 10 NAACL 2025 Enhancing Automated Interpretability with Output-Centric Feature Descriptions ACL 2025 Inferring Functionality of Attention Heads from their Parameters ACL 2025 Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models ACL 2025 Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts? ACL 2025 Eliciting Textual Descriptions from Representations of Continuous Prompts ACL 2025 Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas ICML 2025 Preventing Rogue Agents Improves Multi-Agent Collaboration ACL 2025 Towards Interpreting Visual Information Processing in Vision-Language Models ICLR 2025 RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations ACL 2024 A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains ACL 2024 The Hidden Space of Transformer Language Adapters ACL 2024 Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers ACL 2024 Do Large Language Models Latently Perform Multi-Hop Reasoning? ACL 2024 Jump to Conclusions: Short-Cutting Transformers with Linear Transformations COLING 2024 Backward Lens: Projecting Language Model Gradients into the Vocabulary Space EMNLP 2024 From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP EMNLP 2024 Estimating Knowledge in Large Language Models Without Generating a Single Token EMNLP 2024 Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? EMNLP 2024 Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries EMNLP 2024 The Hidden Language of Diffusion Models ICLR 2024 Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models ICML 2024 CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks EMNLP 2023 LM vs LM: Detecting Factual Errors via Cross Examination EMNLP 2023 In-Context Learning Creates Task Vectors EMNLP 2023 A Comprehensive Evaluation of Tool-Assisted Generation Strategies EMNLP 2023 Analyzing Transformers in Embedding Space ACL 2023 Crawling The Internal Knowledge-Base of Language Models EACL 2023 Don’t Blame the Annotator: Bias Already Starts in the Annotation Instructions EACL 2023 Complex Reasoning in Natural Language ACL 2023 Understanding Transformer Memorization Recall Through Idioms EACL 2023 Dissecting Recall of Factual Associations in Auto-Regressive Language Models EMNLP 2023 SCROLLS: Standardized CompaRison Over Long Language Sequences EMNLP 2022 Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space EMNLP 2022 LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models EMNLP 2022 Inferring Implicit Relations in Complex Questions with Language Models EMNLP 2022 What’s in Your Head? Emergent Behaviour in Multi-Task Transformer Models EMNLP 2021 Transformer Feed-Forward Layers Are Key-Value Memories EMNLP 2021 Injecting Numerical Reasoning Skills into Language Models ACL 2020 Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets IJCNLP 2019 DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion NAACL 2019 Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets EMNLP 2019 Learning to Search in Long Documents Using Document Structure COLING 2018