Mor Geva
48 papers · 2018–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π Conference Polyglot (8) π Academic Marathon (7) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (4)
π
Cross-Pollinator
(4)
π
Renaissance Researcher
(5)
πΊοΈ
Taxonomy Completionist
(65)
π
Conference Loyalist
(20)
π¬
Deep Specialist
(24)
π
Keyword Champion
(3)
π€
Dynamic Duo
(11)
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(196)
π
Century Club
(46)
β
The Questioner
(8)
β‘
Prolific Year
(10)
Conferences
EMNLP (20)
ACL (15)
EACL (4)
COLING (2)
ICLR (2)
ICML (2)
NAACL (2)
IJCNLP (1)
Top co-authors
Keywords
language model
(9)
large language model
(7)
transformer model
(6)
question answering
(6)
representation learning
(6)
model interpretability
(5)
transfer learning
(5)
multi-hop reasoning
(3)
hidden representation
(3)
knowledge retrieval
(3)
mechanistic interpretability
(3)
dataset bia
(3)
latent reasoning
(3)
commonsense reasoning
(2)
transformer language model
(2)
attention mechanism
(2)
model generalization
(2)
model alignment
(2)
in-context learning
(2)
natural language understanding
(2)
Papers
Constructing Interpretable Features from Compositional Neuron Groups
ACL 2026
Detecting (Un)answerability in Large Language Models with Linear Directions
EACL 2026
How Well Can Reasoning Models Identify and Recover from Unhelpful Thoughts?
EMNLP 2025
Intrinsic Test of Unlearning Using Parametric Knowledge Traces
EMNLP 2025
Precise In-Parameter Concept Erasure in Large Language Models
EMNLP 2025
Language Models Encode Numbers Using Digit Representations in Base 10
NAACL 2025
Enhancing Automated Interpretability with Output-Centric Feature Descriptions
ACL 2025
Inferring Functionality of Attention Heads from their Parameters
ACL 2025
Performance Gap in Entity Knowledge Extraction Across Modalities in Vision Language Models
ACL 2025
Do Large Language Models Perform Latent Multi-Hop Reasoning without Exploiting Shortcuts?
ACL 2025
Eliciting Textual Descriptions from Representations of Continuous Prompts
ACL 2025
Why Is Spatial Reasoning Hard for VLMs? An Attention Mechanism Perspective on Focus Areas
ICML 2025
Preventing Rogue Agents Improves Multi-Agent Collaboration
ACL 2025
Towards Interpreting Visual Information Processing in Vision-Language Models
ICLR 2025
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations
ACL 2024
A Chain-of-Thought Is as Strong as Its Weakest Link: A Benchmark for Verifiers of Reasoning Chains
ACL 2024
The Hidden Space of Transformer Language Adapters
ACL 2024
Narrowing the Knowledge Evaluation Gap: Open-Domain Question Answering with Multi-Granularity Answers
ACL 2024
Do Large Language Models Latently Perform Multi-Hop Reasoning?
ACL 2024
Jump to Conclusions: Short-Cutting Transformers with Linear Transformations
COLING 2024
Backward Lens: Projecting Language Model Gradients into the Vocabulary Space
EMNLP 2024
From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP
EMNLP 2024
Estimating Knowledge in Large Language Models Without Generating a Single Token
EMNLP 2024
Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words?
EMNLP 2024
Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries
EMNLP 2024
The Hidden Language of Diffusion Models
ICLR 2024
Patchscopes: A Unifying Framework for Inspecting Hidden Representations of Language Models
ICML 2024
CRoW: Benchmarking Commonsense Reasoning in Real-World Tasks
EMNLP 2023
LM vs LM: Detecting Factual Errors via Cross Examination
EMNLP 2023
In-Context Learning Creates Task Vectors
EMNLP 2023
A Comprehensive Evaluation of Tool-Assisted Generation Strategies
EMNLP 2023
Analyzing Transformers in Embedding Space
ACL 2023
Crawling The Internal Knowledge-Base of Language Models
EACL 2023
Donβt Blame the Annotator: Bias Already Starts in the Annotation Instructions
EACL 2023
Complex Reasoning in Natural Language
ACL 2023
Understanding Transformer Memorization Recall Through Idioms
EACL 2023
Dissecting Recall of Factual Associations in Auto-Regressive Language Models
EMNLP 2023
SCROLLS: Standardized CompaRison Over Long Language Sequences
EMNLP 2022
Transformer Feed-Forward Layers Build Predictions by Promoting Concepts in the Vocabulary Space
EMNLP 2022
LM-Debugger: An Interactive Tool for Inspection and Intervention in Transformer-Based Language Models
EMNLP 2022
Inferring Implicit Relations in Complex Questions with Language Models
EMNLP 2022
Whatβs in Your Head? Emergent Behaviour in Multi-Task Transformer Models
EMNLP 2021
Transformer Feed-Forward Layers Are Key-Value Memories
EMNLP 2021
Injecting Numerical Reasoning Skills into Language Models
ACL 2020
Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets
IJCNLP 2019
DiscoFuse: A Large-Scale Dataset for Discourse-Based Sentence Fusion
NAACL 2019
Are We Modeling the Task or the Annotator? An Investigation of Annotator Bias in Natural Language Understanding Datasets
EMNLP 2019
Learning to Search in Long Documents Using Document Structure
COLING 2018