conftrace_

Suchin Gururangan

24 papers · 2018–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓
+11 more ↓ ๐ŸŒ Conference Polyglot (7) ๐Ÿƒ Academic Marathon (7) ๐Ÿงญ Keyword Pioneer ๐ŸŒ‰ Interdisciplinary Bridge ๐Ÿ Cross-Pollinator (13)
๐Ÿ Cross-Pollinator (13) ๐ŸŒˆ Renaissance Researcher (5) ๐Ÿ—บ๏ธ Taxonomy Completionist (58) ๐Ÿ‘ฅ Mega-Team (60) ๐Ÿ‘‘ Triple Crown ๐Ÿค Dynamic Duo (14) ๐Ÿ—ƒ๏ธ Keyword Collector (120) ๐Ÿ’Ž Century Club (24) ๐Ÿ”ฅ Unstoppable (5) โ“ The Questioner โšก Prolific Year (6)

Conferences

EMNLP (8) ACL (5) NAACL (5) ICLR (2) IJCNLP (2) ICML (1) NIPS (1)

Papers

BTS: Harmonizing Specialized Experts into a Generalist LLM EMNLP 2025 Self-Generated Critiques Boost Reward Modeling for Language Models NAACL 2025 Language models scale reliably with over-training and on downstream tasks ICLR 2025 Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models EMNLP 2024 Time is Encoded in the Weights of Finetuned Language Models ACL 2024 AboutMe: Using Self-Descriptions in Webpages to Document the Effects of English Pretraining Data Filters ACL 2024 DataComp-LM: In search of the next generation of training sets for language models NIPS 2024 SILO Language Models: Isolating Legal Risk In a Nonparametric Datastore ICLR 2024 LESS: Selecting Influential Data for Targeted Instruction Tuning ICML 2024 Time Waits for No One! Analysis and Challenges of Temporal Misalignment NAACL 2022 M2D2: A Massively Multi-Domain Language Modeling Dataset EMNLP 2022 Nearest Neighbor Zero-Shot Inference EMNLP 2022 DEMix Layers: Disentangling Domains for Modular Language Modeling NAACL 2022 Whose Language Counts as High Quality? Measuring Language Ideologies in Text Data Selection EMNLP 2022 Expected Validation Performance and Estimation of a Random Variableโ€™s Maximum EMNLP 2021 All Thatโ€™s โ€˜Humanโ€™ Is Not Gold: Evaluating Human Evaluation of Generated Text ACL 2021 All Thatโ€™s โ€˜Humanโ€™ Is Not Gold: Evaluating Human Evaluation of Generated Text IJCNLP 2021 Detoxifying Language Models Risks Marginalizing Minority Voices NAACL 2021 RealToxicityPrompts: Evaluating Neural Toxic Degeneration in Language Models EMNLP 2020 Donโ€™t Stop Pretraining: Adapt Language Models to Domains and Tasks ACL 2020 Show Your Work: Improved Reporting of Experimental Results EMNLP 2019 Variational Pretraining for Semi-supervised Text Classification ACL 2019 Show Your Work: Improved Reporting of Experimental Results IJCNLP 2019 Annotation Artifacts in Natural Language Inference Data NAACL 2018