conftrace_

Hannaneh Hajishirzi

155 papers · 2013–2025 · 13 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+18 more ↓ 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird πŸ—ΊοΈ Taxonomy Completionist (16) πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (13)
πŸŒ‰ Interdisciplinary Bridge 🌍 Conference Polyglot (13) πŸ—ΊοΈ Taxonomy Completionist (16) 🏠 Conference Loyalist (36) 🀝 Dynamic Duo (27) πŸ‘‘ Triple Crown πŸ† Grand Slam πŸ‘₯ Mega-Team (50) 🌱 Topic Pioneer πŸ”¬ Deep Specialist (31) πŸ† Keyword Champion (2) ❓ The Questioner (6) ⚑ Prolific Year (25) πŸ’Ž Century Club (155) πŸ—ƒοΈ Keyword Collector (529) πŸ“ˆ Trend Setter πŸš€ Conference Pioneer πŸ”₯ Unstoppable (13)

Conferences

EMNLP (48) ACL (36) NAACL (22) ICLR (15) NIPS (12) CVPR (6) ICML (5) IJCNLP (5) SEMEVAL (2) AAAI (1) ECCV (1) ICCV (1) INTERSPEECH (1)

Papers

A Systematic Examination of Preference Learning through the Lens of Instruction-Following NAACL 2025 OLMES: A Standard for Language Model Evaluations NAACL 2025 SciRIFF: A Resource to Enhance Language Model Instruction-Following over Scientific Literature EMNLP 2025 ComPO: Community Preferences for Language Model Personalization NAACL 2025 Hybrid Preferences: Learning to Route Instances for Human vs. AI Feedback ACL 2025 Steering off Course: Reliability Challenges in Steering Language Models ACL 2025 OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens ACL 2025 Organize the Web: Constructing Domains Enhances Pre-Training Data Curation ICML 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 s1: Simple test-time scaling EMNLP 2025 Infini-gram mini: Exact n-gram Search at the Internet Scale with FM-Index EMNLP 2025 Answer, Assemble, Ace: Understanding How LMs Answer Multiple Choice Questions ICLR 2025 OLMoE: Open Mixture-of-Experts Language Models ICLR 2025 RewardBench: Evaluating Reward Models for Language Modeling NAACL 2025 The Art of Saying No: Contextual Noncompliance in Language Models NIPS 2024 Decoding-Time Language Model Alignment with Multiple Objectives NIPS 2024 Data Engineering for Scaling Language Models to 128K Context ICML 2024 APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference ICML 2024 Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback NIPS 2024 Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection ICLR 2024 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore ICLR 2024 MatFormer: Nested Transformer for Elastic Inference NIPS 2024 CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation EMNLP 2024 Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging EMNLP 2024 BTR: Binary Token Representations for Efficient Retrieval Augmented Language Models ICLR 2024 MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts ICLR 2024 BUFFET: Benchmarking Large Language Models for Few-shot Cross-lingual Transfer NAACL 2024 ActionAtlas: A VideoQA Benchmark for Domain-specialized Action Recognition NIPS 2024 Paloma: A Benchmark for Evaluating Language Model Fit NIPS 2024 Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research ACL 2024 OLMo: Accelerating the Science of Language Models ACL 2024 Set the Clock: Temporal Alignment of Pretrained Language Models ACL 2024 What's In My Big Data? ICLR 2024 FActScore: Fine-grained Atomic Evaluation of Factual Precision in Long Form Text Generation EMNLP 2023 SHARCS: Efficient Transformers Through Routing with Dynamic Width Sub-networks EMNLP 2023 Editing models with task arithmetic ICLR 2023 AGRO: Adversarial discovery of error-prone Groups for Robust Optimization ICLR 2023 Is Reinforcement Learning (Not) for Natural Language Processing: Benchmarks, Baselines, and Building Blocks for Natural Language Policy Optimization ICLR 2023 DataComp: In search of the next generation of multimodal datasets NIPS 2023 GenEval: An object-focused framework for evaluating text-to-image alignment NIPS 2023 Fine-Grained Human Feedback Gives Better Rewards for Language Model Training NIPS 2023 How Far Can Camels Go? Exploring the State of Instruction Tuning on Open Resources NIPS 2023 Elaboration-Generating Commonsense Question Answering at Scale ACL 2023 Z-ICL: Zero-Shot In-Context Learning with Pseudo-Demonstrations ACL 2023 FiD-ICL: A Fusion-in-Decoder Approach for Efficient In-Context Learning ACL 2023 When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories ACL 2023 CREPE: Open-Domain Question Answering with False Presuppositions ACL 2023 HINT: Hypernetwork Instruction Tuning for Efficient Zero- and Few-Shot Generalisation ACL 2023 PuMer: Pruning and Merging Tokens for Efficient Vision Language Models ACL 2023 Self-Instruct: Aligning Language Models with Self-Generated Instructions ACL 2023 Nonparametric Masked Language Modeling ACL 2023 Task-aware Retrieval with Instructions ACL 2023 Data-Efficient Finetuning Using Cross-Task Nearest Neighbors ACL 2023 Do Embodied Agents Dream of Pixelated Sheep: Embodied Decision Making using Language Guided World Modelling ICML 2023 Vera: A General-Purpose Plausibility Estimation Model for Commonsense Statements EMNLP 2023 TaskWeb: Selecting Better Source Tasks for Multi-task NLP EMNLP 2023 Crystal: Introspective Reasoners Reinforced with Self-Feedback EMNLP 2023 Machine Reading Comprehension using Case-based Reasoning EMNLP 2023 Reframing Instructional Prompts to GPTk’s Language ACL 2022 Noisy Channel Language Model Prompting for Few-Shot Text Classification ACL 2022 FaVIQ: FAct Verification from Information-seeking Questions ACL 2022 Cross-Task Generalization via Natural Language Crowdsourcing Instructions ACL 2022 ATTEMPT: Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts EMNLP 2022 Rainier: Reinforced Knowledge Introspector for Commonsense Question Answering EMNLP 2022 Correcting Diverse Factual Errors in Abstractive Summarization via Post-Editing and Language Model Infilling EMNLP 2022 CONQRR: Conversational Query Rewriting for Retrieval with Reinforcement Learning EMNLP 2022 Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? EMNLP 2022 CORE: A Retrieve-then-Edit Framework for Counterfactual Data Generation EMNLP 2022 SciFact-Open: Towards open-domain scientific claim verification EMNLP 2022 Robust Fine-Tuning of Zero-Shot Models CVPR 2022 Evidentiality-guided Generation for Knowledge-Intensive NLP Tasks NAACL 2022 Generated Knowledge Prompting for Commonsense Reasoning ACL 2022 MetaICL: Learning to Learn In Context NAACL 2022 Prompt Waywardness: The Curious Case of Discretized Interpretation of Continuous Prompts NAACL 2022 Patching open-vocabulary models by interpolating weights NIPS 2022 MultiVerS: Improving scientific claim verification with weak supervision and full-document context NAACL 2022 Aligning to Social Norms and Values in Interactive Narratives NAACL 2022 NaturalProver: Grounded Mathematical Proof Generation with Language Models NIPS 2022 Exploring The Landscape of Distributional Robustness for Question Answering Models EMNLP 2022 Knowledge Base Question Answering by Case-based Reasoning over Subgraphs ICML 2022 Retrieval Data Augmentation Informed by Downstream Question Answering Performance ACL 2022 Extracting a Knowledge Base of Mechanisms from COVID-19 Papers NAACL 2021 Prompting Contrastive Explanations for Commonsense Reasoning Tasks IJCNLP 2021 Efficient Passage Retrieval with Hashing for Open-domain Question Answering IJCNLP 2021 Prompting Contrastive Explanations for Commonsense Reasoning Tasks ACL 2021 Efficient Passage Retrieval with Hashing for Open-domain Question Answering ACL 2021 MultiModalQA: complex question answering over text, tables and images ICLR 2021 A Controllable Model of Grounded Response Generation AAAI 2021 DeLighT: Deep and Light-weight Transformer ICLR 2021 DIALKI: Knowledge Identification in Conversational Systems through Dialogue-Document Contextualization EMNLP 2021 Iconary: A Pictionary-Based Game for Testing Multimodal Communication with Drawings and Text EMNLP 2021 Joint Passage Ranking for Diverse Multi-Answer Retrieval EMNLP 2021 GooAQ: Open Question Answering with Diverse Answer Types EMNLP 2021 Probing Across Time: What Does RoBERTa Know and When? EMNLP 2021 Beyond Paragraphs: NLP for Long Sequences NAACL 2021 Probing Contextual Language Models for Common Ground with Visual Representations NAACL 2021 XOR QA: Cross-lingual Open-Retrieval Question Answering NAACL 2021 Evaluating Models’ Local Decision Boundaries via Contrast Sets EMNLP 2020 Contextualized Sparse Representations for Real-Time Open-Domain Question Answering ACL 2020 Logic-Guided Data Augmentation and Regularization for Consistent Question Answering ACL 2020 SciREX: A Challenge Dataset for Document-Level Information Extraction ACL 2020 ZeroShotCeres: Zero-Shot Relation Extraction from Semi-Structured Webpages ACL 2020 Multi-modal Information Extraction from Text, Semi-structured, and Tabular Data on the Web ACL 2020 IIRC: A Dataset of Incomplete Information Reading Comprehension Questions EMNLP 2020 An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction EMNLP 2020 AmbigQA: Answering Ambiguous Open-domain Questions EMNLP 2020 Fact or Fiction: Verifying Scientific Claims EMNLP 2020 X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers EMNLP 2020 Dataset Cartography: Mapping and Diagnosing Datasets with Training Dynamics EMNLP 2020 UNIFIEDQA: Crossing Format Boundaries with a Single QA System EMNLP 2020 MedICaT: A Dataset of Medical Images, Captions, and Textual References EMNLP 2020 Learning to Retrieve Reasoning Paths over Wikipedia Graph for Question Answering ICLR 2020 DeFINE: Deep Factorized Input Token Embeddings for Neural Sequence Modeling ICLR 2020 Multi-hop Reading Comprehension through Question Decomposition and Rescoring ACL 2019 ESPNetv2: A Light-Weight, Power Efficient, and General Purpose Convolutional Neural Network CVPR 2019 A general framework for information extraction using dynamic span graphs NAACL 2019 MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms NAACL 2019 Text Generation from Knowledge Graphs with Graph Transformers NAACL 2019 A Discrete Hard EM Approach for Weakly Supervised Question Answering EMNLP 2019 SemEval-2019 Task 10: Math Question Answering SEMEVAL 2019 On Making Reading Comprehension More Comprehensive EMNLP 2019 Entity, Relation, and Event Extraction with Contextualized Span Representations EMNLP 2019 Mixture Content Selection for Diverse Sequence Generation EMNLP 2019 Compositional Questions Do Not Necessitate Multi-hop Reasoning ACL 2019 Real-Time Open-Domain Question Answering with Dense-Sparse Phrase Index ACL 2019 A Discrete Hard EM Approach for Weakly Supervised Question Answering IJCNLP 2019 Mixture Content Selection for Diverse Sequence Generation IJCNLP 2019 Entity, Relation, and Event Extraction with Contextualized Span Representations IJCNLP 2019 Neural Speed Reading via Skim-RNN ICLR 2018 Standardized Tests as benchmarks for Artificial Intelligence EMNLP 2018 Pyramidal Recurrent Unit for Language Modeling EMNLP 2018 Multi-Task Identification of Entities, Relations, and Coreference for Scientific Knowledge Graph Construction EMNLP 2018 Phrase-Indexed Question Answering: A New Challenge for Scalable Document Comprehension EMNLP 2018 Semi-Supervised Event Extraction with Paraphrase Clusters NAACL 2018 ESPNet: Efficient Spatial Pyramid of Dilated Convolutions for Semantic Segmentation ECCV 2018 The UWNLP system at SemEval-2018 Task 7: Neural Relation Extraction Model with Selectively Incorporated Concept Embeddings SEMEVAL 2018 Scientific Information Extraction with Semi-supervised Neural Tagging EMNLP 2017 Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension CVPR 2017 Question Answering through Transfer Learning from Large Fine-grained Supervision Data ACL 2017 Learning Prototypical Event Structure from Photo Albums ACL 2016 Disfluency Detection Using a Bidirectional LSTM INTERSPEECH 2016 A Task-Oriented Approach for Cost-Sensitive Recognition CVPR 2016 A Theme-Rewriting Approach for Generating Algebra Word Problems EMNLP 2016 Multiplicative Representations for Unsupervised Semantic Role Induction ACL 2016 MAWPS: A Math Word Problem Repository NAACL 2016 Learning Knowledge Graphs for Question Answering through Conversational Dialog NAACL 2015 Discriminative and Consistent Similarities in Instance-Level Multiple Instance Learning CVPR 2015 Talking to the crowd: What do people react to in online discussions? EMNLP 2015 Unediting: Detecting Disfluencies Without Careful Transcripts NAACL 2015 Aligning Sentences from Standard Wikipedia to Simple Wikipedia NAACL 2015 Segment-Phrase Table for Semantic Segmentation, Visual Entailment and Paraphrasing ICCV 2015 Solving Geometry Problems: Combining Text and Diagram Interpretation EMNLP 2015 Learning to Solve Arithmetic Word Problems with Verb Categorization EMNLP 2014 Multi-Resolution Language Grounding with Weak Supervision EMNLP 2014 Joint Coreference Resolution and Named-Entity Linking with Multi-Pass Sieves EMNLP 2013