Tristan Thrush

20 papers · 2020–2025 · 8 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (12)

🌍 Conference Polyglot (8) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (7) 🤝 Dynamic Duo (13) 👥 Mega-Team (54) 🔥 Unstoppable (6) 💎 Century Club (20) 🗃️ Keyword Collector (99) ⚡ Prolific Year (6)

Conferences

EMNLP (5) ACL (4) NIPS (4) NAACL (3) CVPR (1) ICLR (1) ICML (1) IJCNLP (1)

Top co-authors

Douwe Kiela (13) Max Bartolo (6) Adina Williams (5) Robin Jia (4) Bertie Vidgen (4) Christopher Potts (4) Zeerak Waseem (3) Amanpreet Singh (3) Sebastian Riedel (3) Pontus Stenetorp (3)

Keywords

model evaluation (4) adversarial training (4) hate detection (3) hate speech detection (3) data augmentation (3) model robustness (3) few-shot learning (2) human evaluation (2) multimodal learning (2) text classification (2) natural language processing (2) adversarial learning (2) language model (2) question answering (2) benchmark evaluation (2) vision language model (2) contrastive learning (2) dataset benchmark (2) dataset creation (1) visual question answering (1)

Papers

Improving Pretraining Data Using Perplexity Correlations ICLR 2025 MixMin: Finding Data Mixtures via Convex Minimization ICML 2025 ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation ACL 2024 Nearest Neighbor Normalization Improves Multimodal Retrieval EMNLP 2024 I am a Strange Dataset: Metalinguistic Tests for Language Models ACL 2024 DataPerf: Benchmarks for Data-Centric AI Development NIPS 2023 The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset NIPS 2022 Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks ACL 2022 Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate NAACL 2022 Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality CVPR 2022 Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants NAACL 2022 Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements EMNLP 2022 Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation EMNLP 2021 Human-Adversarial Visual Question Answering NIPS 2021 Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection ACL 2021 Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking NIPS 2021 Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation EMNLP 2021 Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection IJCNLP 2021 Dynabench: Rethinking Benchmarking in NLP NAACL 2021 Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization EMNLP 2020