Shayne Longpre
32 papers · 2019–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Conference Polyglot (10) π Academic Marathon (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Cross-Pollinator (9)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(42)
π
Triple Crown
π
Grand Slam
π₯
Mega-Team
(54)
π
Century Club
(32)
ποΈ
Keyword Collector
(101)
β
The Questioner
(3)
β‘
Prolific Year
(5)
π₯
Unstoppable
(7)
Conferences
ICML (6)
NAACL (6)
EMNLP (5)
ACL (4)
ICLR (4)
NIPS (3)
AAAI (1)
COLING (1)
IJCNLP (1)
JMLR (1)
Top co-authors
Keywords
question answering
(6)
large language model
(6)
language model
(4)
open-domain question answering
(2)
question rewriting
(2)
open-domain nlp
(2)
retrieval system
(2)
reading comprehension
(2)
multilingual language model
(2)
instruction finetuning
(2)
data augmentation
(2)
benchmark evaluation
(2)
popularity bia
(2)
entity disambiguation
(2)
text classification
(2)
language model evaluation
(2)
multilingual evaluation
(2)
responsible ai
(2)
conversational ai
(1)
natural language processing
(1)
Papers
Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI
ICML 2025
To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices
AAAI 2025
Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation
ACL 2025
Bridging the Data Provenance Gap Across Text, Speech, and Video
ICLR 2025
The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models
NAACL 2025
OctoPack: Instruction Tuning Code Large Language Models
ICLR 2024
A Systematic Review of NeurIPS Dataset Management Practices
NIPS 2024
Consent in Crisis: The Rapid Decline of the AI Data Commons
NIPS 2024
Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model
ACL 2024
Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models
EMNLP 2024
Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models
ICLR 2024
Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models
ICLR 2024
Position: On the Societal Impact of Open Foundation Models
ICML 2024
Position: A Safe Harbor for AI Evaluation and Red Teaming
ICML 2024
Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them?
ICML 2024
Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research
ICML 2024
Scaling Instruction-Finetuned Language Models
JMLR 2024
A Pretrainerβs Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity
NAACL 2024
The Flan Collection: Designing Data and Methods for Effective Instruction Tuning
ICML 2023
Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks
COLING 2022
Pivot Through English: Reliably Answering Multilingual Questions without Document Retrieval
NAACL 2022
MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages
NAACL 2022
You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings
ACL 2022
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP
IJCNLP 2021
Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP
ACL 2021
Entity-Based Knowledge Conflicts in Question Answering
EMNLP 2021
Open-Domain Question Answering Goes Conversational via Question Rewriting
NAACL 2021
On the Transferability of Minimal Prediction Preserving Inputs in Question Answering
NAACL 2021
How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers?
EMNLP 2020
A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering
EMNLP 2020
An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering
EMNLP 2019