Tristan Thrush
20 papers · 2020–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
🏃 Academic Marathon (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🐝 Cross-Pollinator (12)
🌍
Conference Polyglot
(8)
🏃
Academic Marathon
(5)
🌈
Renaissance Researcher
(7)
🤝
Dynamic Duo
(13)
👥
Mega-Team
(54)
🔥
Unstoppable
(6)
💎
Century Club
(20)
🗃️
Keyword Collector
(99)
⚡
Prolific Year
(6)
Conferences
EMNLP (5)
ACL (4)
NIPS (4)
NAACL (3)
CVPR (1)
ICLR (1)
ICML (1)
IJCNLP (1)
Top co-authors
Keywords
model evaluation
(4)
adversarial training
(4)
hate detection
(3)
hate speech detection
(3)
data augmentation
(3)
model robustness
(3)
few-shot learning
(2)
human evaluation
(2)
multimodal learning
(2)
text classification
(2)
natural language processing
(2)
adversarial learning
(2)
language model
(2)
question answering
(2)
benchmark evaluation
(2)
vision language model
(2)
contrastive learning
(2)
dataset benchmark
(2)
dataset creation
(1)
visual question answering
(1)
Papers
Improving Pretraining Data Using Perplexity Correlations
ICLR 2025
MixMin: Finding Data Mixtures via Convex Minimization
ICML 2025
ColorSwap: A Color and Word Order Dataset for Multimodal Evaluation
ACL 2024
Nearest Neighbor Normalization Improves Multimodal Retrieval
EMNLP 2024
I am a Strange Dataset: Metalinguistic Tests for Language Models
ACL 2024
DataPerf: Benchmarks for Data-Centric AI Development
NIPS 2023
The BigScience ROOTS Corpus: A 1.6TB Composite Multilingual Dataset
NIPS 2022
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
ACL 2022
Hatemoji: A Test Suite and Adversarially-Generated Dataset for Benchmarking and Detecting Emoji-Based Hate
NAACL 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
CVPR 2022
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
NAACL 2022
Evaluate & Evaluation on the Hub: Better Best Practices for Data and Model Measurements
EMNLP 2022
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
EMNLP 2021
Human-Adversarial Visual Question Answering
NIPS 2021
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
ACL 2021
Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking
NIPS 2021
Findings of the WMT 2021 Shared Task on Large-Scale Multilingual Machine Translation
EMNLP 2021
Learning from the Worst: Dynamically Generated Datasets to Improve Online Hate Detection
IJCNLP 2021
Dynabench: Rethinking Benchmarking in NLP
NAACL 2021
Investigating Novel Verb Learning in BERT: Selectional Preference Classes and Alternation-Based Syntactic Generalization
EMNLP 2020