Max Bartolo

17 papers · 2018–2025 · 6 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (7) 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (39)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌈 Renaissance Researcher (6) 👥 Mega-Team (47) 🔥 Unstoppable (6) 🗃️ Keyword Collector (85) 💎 Century Club (17)

Conferences

EMNLP (6) ACL (3) NAACL (3) ICLR (2) NIPS (2) CVPR (1)

Top co-authors

Sebastian Riedel (6) Pontus Stenetorp (6) Tristan Thrush (6) Douwe Kiela (6) Adina Williams (5) Maximilian Mozes (3) Robin Jia (3) Peter Mattson (2) William Gaviria Rojas (2) Rafael Mosquera (2)

Keywords

model robustness (4) large language model (4) adversarial training (2) dataset collection (2) model evaluation (2) question answering (2) human feedback (2) model alignment (2) language model (2) text classification (2) few-shot learning (2) data augmentation (2) in-context learning (1) benchmark suite (1) mathematical reasoning (1) adversarial learning (1) dataset creation (1) visual reasoning (1) reward modeling (1) sentiment classification (1)

Papers

Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models ICLR 2025 Improving Reward Models with Synthetic Critiques NAACL 2025 No Need for Explanations: LLMs can implicitly learn from mistakes in-context EMNLP 2025 Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models EMNLP 2024 Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning ACL 2024 The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models NIPS 2024 Human Feedback is not Gold Standard ICLR 2024 DataPerf: Benchmarks for Data-Centric AI Development NIPS 2023 Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity ACL 2022 Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality CVPR 2022 Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks ACL 2022 Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants NAACL 2022 Dynabench: Rethinking Benchmarking in NLP NAACL 2021 Contrasting Human- and Machine-Generated Word-Level Adversarial Examples for Text Classification EMNLP 2021 Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation EMNLP 2021 Undersensitivity in Neural Reading Comprehension EMNLP 2020 Interpretation of Natural Language Rules in Conversational Machine Reading EMNLP 2018