Hannaneh Hajishirzi
155 papers · 2013–2025 · 13 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
π§ Keyword Pioneer π£ Hot Topic Early Bird πΊοΈ Taxonomy Completionist (16) π Interdisciplinary Bridge π Conference Polyglot (13)
π
Interdisciplinary Bridge
π
Conference Polyglot
(13)
πΊοΈ
Taxonomy Completionist
(16)
π
Conference Loyalist
(36)
π€
Dynamic Duo
(27)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(50)
π±
Topic Pioneer
π¬
Deep Specialist
(31)
π
Keyword Champion
(2)
β
The Questioner
(6)
β‘
Prolific Year
(25)
π
Century Club
(155)
ποΈ
Keyword Collector
(529)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(13)
Conferences
EMNLP (48)
ACL (36)
NAACL (22)
ICLR (15)
NIPS (12)
CVPR (6)
ICML (5)
IJCNLP (5)
SEMEVAL (2)
AAAI (1)
ECCV (1)
ICCV (1)
INTERSPEECH (1)
Top co-authors
Research topics
Keywords
language model
(27)
question answering
(23)
zero-shot learning
(12)
multi-task learning
(10)
few-shot learning
(9)
information retrieval
(9)
in-context learning
(8)
large language model
(8)
relation extraction
(7)
information extraction
(7)
commonsense reasoning
(6)
named entity recognition
(6)
reinforcement learning
(6)
transfer learning
(6)
passage retrieval
(5)
text generation
(5)
instruction tuning
(5)
open-domain question answering
(5)
language modeling
(5)
efficient computing
(5)
Papers
A Systematic Examination of Preference Learning through the Lens of Instruction-Following
NAACL 2025
OLMES: A Standard for Language Model Evaluations
NAACL 2025
SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature
EMNLP 2025
ComPO: Community Preferences for Language Model Personalization
NAACL 2025
Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback
ACL 2025
Steering off Course: Reliability Challenges in Steering Language Models
ACL 2025
OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens
ACL 2025
Organize the Web: Constructing Domains Enhances Pre-Training Data Curation
ICML 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
s1: Simple test-time scaling
EMNLP 2025
Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index
EMNLP 2025
Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions
ICLR 2025
OLMoE: Open Mixture-of-Experts Language Models
ICLR 2025
RewardBench: Evaluating Reward Models for Language Modeling
NAACL 2025
The Art of Saying No: Contextual Noncompliance in Language Models
NIPS 2024
Decoding-Time Language Model Alignment with Multiple Objectives
NIPS 2024
Data Engineering for Scaling Language Models to 128K Context
ICML 2024
APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference
ICML 2024
Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback
NIPS 2024
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
ICLR 2024
SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore
ICLR 2024
MatFormer: Nested Transformer for Elastic Inference
NIPS 2024
CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation
EMNLP 2024
Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
EMNLP 2024
BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models
ICLR 2024
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts
ICLR 2024
BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer
NAACL 2024
ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition
NIPS 2024
Paloma: A Benchmark for Evaluating Language Model Fit
NIPS 2024
Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research
ACL 2024
OLMo: Accelerating the Science of Language Models
ACL 2024
Set the Clock: Temporal Alignment of Pretrained Language Models
ACL 2024
What's In My Big Data?
ICLR 2024
FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation
EMNLP 2023
SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks
EMNLP 2023
Editing models with task arithmetic
ICLR 2023
AGRO: Adversarial discovery of error-prone Groups for Robust Optimization
ICLR 2023
Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization
ICLR 2023
DataComp: In search of the next generation of multimodal datasets
NIPS 2023
GenEval: An object-focused framework for evaluating text-to-image alignment
NIPS 2023
Fine-Grained Human Feedback Gives Better Rewards for Language Model Training
NIPS 2023
How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources
NIPS 2023
Elaboration-Generating Commonsense Question Answering at Scale
ACL 2023
Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations
ACL 2023
FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning
ACL 2023
When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories
ACL 2023
CREPE: Open-Domain Question Answering with False Presuppositions
ACL 2023
HINT: Hypernetwork Instruction Tuning for Efficient Zero- and Few-Shot Generalisation
ACL 2023
PuMer: Pruning and Merging Tokens for Efficient Vision Language Models
ACL 2023
Self-Instruct: Aligning Language Models with Self-Generated Instructions
ACL 2023
Nonparametric Masked Language Modeling
ACL 2023
Task-aware Retrieval with Instructions
ACL 2023
Data-Efficient Finetuning Using Cross-Task Nearest Neighbors
ACL 2023
Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling
ICML 2023
Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements
EMNLP 2023
TaskWeb: Selecting Better Source Tasks for Multi-task NLP
EMNLP 2023
Crystal: Introspective Reasoners Reinforced with Self-Feedback
EMNLP 2023
Machine Reading Comprehension using Case-based Reasoning
EMNLP 2023
Reframing Instructional Prompts to GPTkβs Language
ACL 2022
Noisy Channel Language Model Prompting for Few-Shot Text Classification
ACL 2022
FaVIQ: FAct Verification from Information-seeking Questions
ACL 2022
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
ACL 2022
ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts
EMNLP 2022
Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering
EMNLP 2022
Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling
EMNLP 2022
CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning
EMNLP 2022
Rethinking the Role of Demonstrations: What Makes In-Context Learning Work?
EMNLP 2022
CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation
EMNLP 2022
SciFact-Open: Towards open-domain scientific claim verification
EMNLP 2022
Robust Fine-Tuning of Zero-Shot Models
CVPR 2022
Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks
NAACL 2022
Generated Knowledge Prompting for Commonsense Reasoning
ACL 2022
MetaICL: Learning to Learn In Context
NAACL 2022
Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts
NAACL 2022
Patching open-vocabulary models by interpolating weights
NIPS 2022
MultiVerS: Improving scientific claim verification with weak supervision and full-document context
NAACL 2022
Aligning to Social Norms and Values in Interactive Narratives
NAACL 2022
NaturalProver: Grounded Mathematical Proof Generation with Language Models
NIPS 2022
Exploring The Landscape of Distributional Robustness for Question Answering Models
EMNLP 2022
Knowledge Base Question Answering by Case-based Reasoning over Subgraphs
ICML 2022
Retrieval Data Augmentation Informed by Downstream Question Answering Performance
ACL 2022
Extracting a Knowledge Base of Mechanisms from COVID-19 Papers
NAACL 2021
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
IJCNLP 2021
Efficient Passage Retrieval with Hashing for Open-domain Question Answering
IJCNLP 2021
Prompting Contrastive Explanations for Commonsense Reasoning Tasks
ACL 2021
Efficient Passage Retrieval with Hashing for Open-domain Question Answering
ACL 2021
MultiModalQA: complex question answering over text, tables and images
ICLR 2021
A Controllable Model of Grounded Response Generation
AAAI 2021
DeLighT: Deep and Light-weight Transformer
ICLR 2021
DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization
EMNLP 2021
Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text
EMNLP 2021
Joint Passage Ranking for Diverse Multi-Answer Retrieval
EMNLP 2021
GooAQ: Open Question Answering with Diverse Answer Types
EMNLP 2021
Probing Across Time: What Does RoBERTa Know and When?
EMNLP 2021
Beyond Paragraphs: NLP for Long Sequences
NAACL 2021
Probing Contextual Language Models for Common Ground with Visual Representations
NAACL 2021
XOR QA: Cross-lingual Open-Retrieval Question Answering
NAACL 2021
Evaluating Modelsβ Local Decision Boundaries via Contrast Sets
EMNLP 2020
Contextualized Sparse Representations for Real-Time Open-Domain Question Answering
ACL 2020
Logic-Guided Data Augmentation and Regularization for Consistent Question Answering
ACL 2020
SciREX: A Challenge Dataset for Document-Level Information Extraction
ACL 2020
ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages
ACL 2020
Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web
ACL 2020
IIRC: A Dataset of Incomplete Information Reading Comprehension Questions
EMNLP 2020
An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction
EMNLP 2020
AmbigQA: Answering Ambiguous Open-domain Questions
EMNLP 2020
Fact or Fiction: Verifying Scientific Claims
EMNLP 2020
X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers
EMNLP 2020
Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics
EMNLP 2020
UNIFIEDQA: Crossing Format Boundaries with a Single QA System
EMNLP 2020
MedICaT: A Dataset of Medical Images, Captions, and Textual References
EMNLP 2020
Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering
ICLR 2020
DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling
ICLR 2020
Multi-hop Reading Comprehension through Question Decomposition and Rescoring
ACL 2019
ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network
CVPR 2019
A general framework for information extraction using dynamic span graphs
NAACL 2019
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms
NAACL 2019
Text Generation from Knowledge Graphs with Graph Transformers
NAACL 2019
A Discrete Hard EM Approach for Weakly Supervised Question Answering
EMNLP 2019
SemEval-2019 Task 10: Math Question Answering
SEMEVAL 2019
On Making Reading Comprehension More Comprehensive
EMNLP 2019
Entity, Relation, and Event Extraction with Contextualized Span Representations
EMNLP 2019
Mixture Content Selection for Diverse Sequence Generation
EMNLP 2019
Compositional Questions Do Not Necessitate Multi-hop Reasoning
ACL 2019
Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index
ACL 2019
A Discrete Hard EM Approach for Weakly Supervised Question Answering
IJCNLP 2019
Mixture Content Selection for Diverse Sequence Generation
IJCNLP 2019
Entity, Relation, and Event Extraction with Contextualized Span Representations
IJCNLP 2019
Neural Speed Reading via Skim-RNN
ICLR 2018
Standardized Tests as benchmarks for Artificial Intelligence
EMNLP 2018
Pyramidal Recurrent Unit for Language Modeling
EMNLP 2018
Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction
EMNLP 2018
Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension
EMNLP 2018
Semi-Supervised Event Extraction with Paraphrase Clusters
NAACL 2018
ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation
ECCV 2018
The UWNLP system at SemEval-2018 Task 7: Neural Relation Extraction Model with Selectively Incorporated Concept Embeddings
SEMEVAL 2018
Scientific Information Extraction with Semi-supervised Neural Tagging
EMNLP 2017
Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension
CVPR 2017
Question Answering through Transfer Learning from Large Fine-grained Supervision Data
ACL 2017
Learning Prototypical Event Structure from Photo Albums
ACL 2016
Disfluency Detection Using a Bidirectional LSTM
INTERSPEECH 2016
A Task-Oriented Approach for Cost-Sensitive Recognition
CVPR 2016
A Theme-Rewriting Approach for Generating Algebra Word Problems
EMNLP 2016
Multiplicative Representations for Unsupervised Semantic Role Induction
ACL 2016
MAWPS: A Math Word Problem Repository
NAACL 2016
Learning Knowledge Graphs for Question Answering through Conversational Dialog
NAACL 2015
Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning
CVPR 2015
Talking to the crowd: What do people react to in online discussions?
EMNLP 2015
Unediting: Detecting Disfluencies Without Careful Transcripts
NAACL 2015
Aligning Sentences from Standard Wikipedia to Simple Wikipedia
NAACL 2015
Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing
ICCV 2015
Solving Geometry Problems: Combining Text and Diagram Interpretation
EMNLP 2015
Learning to Solve Arithmetic Word Problems with Verb Categorization
EMNLP 2014
Multi-Resolution Language Grounding with Weak Supervision
EMNLP 2014
Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves
EMNLP 2013