conftrace_

Shayne Longpre

32 papers · 2019–2025 · 10 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+11 more ↓ 🌍 Conference Polyglot (10) πŸƒ Academic Marathon (6) πŸŒ‰ Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (9)
🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (6) πŸ—ΊοΈ Taxonomy Completionist (42) πŸ‘‘ Triple Crown πŸ† Grand Slam πŸ‘₯ Mega-Team (54) πŸ’Ž Century Club (32) πŸ—ƒοΈ Keyword Collector (101) ❓ The Questioner (3) ⚑ Prolific Year (5) πŸ”₯ Unstoppable (7)

Conferences

ICML (6) NAACL (6) EMNLP (5) ACL (4) ICLR (4) NIPS (3) AAAI (1) COLING (1) IJCNLP (1) JMLR (1)

Papers

Position: In-House Evaluation Is Not Enough. Towards Robust Third-Party Evaluation and Flaw Disclosure for General-Purpose AI ICML 2025 To Err Is AI: A Case Study Informing LLM Flaw Reporting Practices AAAI 2025 Global MMLU: Understanding and Addressing Cultural and Linguistic Biases in Multilingual Evaluation ACL 2025 Bridging the Data Provenance Gap Across Text, Speech, and Video ICLR 2025 The BiGGen Bench: A Principled Benchmark for Fine-grained Evaluation of Language Models with Language Models NAACL 2025 OctoPack: Instruction Tuning Code Large Language Models ICLR 2024 A Systematic Review of NeurIPS Dataset Management Practices NIPS 2024 Consent in Crisis: The Rapid Decline of the AI Data Commons NIPS 2024 Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model ACL 2024 Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models EMNLP 2024 Prometheus: Inducing Fine-Grained Evaluation Capability in Language Models ICLR 2024 Mixture-of-Experts Meets Instruction Tuning: A Winning Combination for Large Language Models ICLR 2024 Position: On the Societal Impact of Open Foundation Models ICML 2024 Position: A Safe Harbor for AI Evaluation and Red Teaming ICML 2024 Position: Data Authenticity, Consent, & Provenance for AI are all broken: what will it take to fix them? ICML 2024 Position: AI-Powered Autonomous Weapons Risk Geopolitical Instability and Threaten AI Research ICML 2024 Scaling Instruction-Finetuned Language Models JMLR 2024 A Pretrainer’s Guide to Training Data: Measuring the Effects of Data Age, Domain Coverage, Quality, & Toxicity NAACL 2024 The Flan Collection: Designing Data and Methods for Effective Instruction Tuning ICML 2023 Combining Compressions for Multiplicative Size Scaling on Natural Language Tasks COLING 2022 Pivot Through English: Reliably Answering Multilingual Questions without Document Retrieval NAACL 2022 MIA 2022 Shared Task: Evaluating Cross-lingual Open-Retrieval Question Answering for 16 Diverse Languages NAACL 2022 You reap what you sow: On the Challenges of Bias Evaluation Under Multilingual Settings ACL 2022 The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset NIPS 2022 Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP IJCNLP 2021 Evaluating Entity Disambiguation and the Role of Popularity in Retrieval-Based NLP ACL 2021 Entity-Based Knowledge Conflicts in Question Answering EMNLP 2021 Open-Domain Question Answering Goes Conversational via Question Rewriting NAACL 2021 On the Transferability of Minimal Prediction Preserving Inputs in Question Answering NAACL 2021 How Effective is Task-Agnostic Data Augmentation for Pretrained Transformers? EMNLP 2020 A Wrong Answer or a Wrong Question? An Intricate Relationship between Question Reformulation and Answer Selection in Conversational Question Answering EMNLP 2020 An Exploration of Data Augmentation and Sampling Techniques for Domain-Agnostic Question Answering EMNLP 2019