Max Bartolo
17 papers · 2018–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (7) π Conference Polyglot (6) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (39)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Renaissance Researcher
(6)
π₯
Mega-Team
(47)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(85)
π
Century Club
(17)
Conferences
EMNLP (6)
ACL (3)
NAACL (3)
ICLR (2)
NIPS (2)
CVPR (1)
Top co-authors
Keywords
model robustness
(4)
large language model
(4)
adversarial training
(2)
dataset collection
(2)
model evaluation
(2)
question answering
(2)
human feedback
(2)
model alignment
(2)
language model
(2)
text classification
(2)
few-shot learning
(2)
data augmentation
(2)
in-context learning
(1)
benchmark suite
(1)
mathematical reasoning
(1)
adversarial learning
(1)
dataset creation
(1)
visual reasoning
(1)
reward modeling
(1)
sentiment classification
(1)
Papers
Procedural Knowledge in Pretraining Drives Reasoning in Large Language Models
ICLR 2025
Improving Reward Models with Synthetic Critiques
NAACL 2025
No Need for Explanations: LLMs can implicitly learn from mistakes in-context
EMNLP 2025
Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models
EMNLP 2024
Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning
ACL 2024
The PRISM Alignment Dataset: What Participatory, Representative and Individualised Human Feedback Reveals About the Subjective and Multicultural Alignment of Large Language Models
NIPS 2024
Human Feedback is not Gold Standard
ICLR 2024
DataPerf: Benchmarks for Data-Centric AI Development
NIPS 2023
Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order Sensitivity
ACL 2022
Winoground: Probing Vision and Language Models for Visio-Linguistic Compositionality
CVPR 2022
Dynatask: A Framework for Creating Dynamic AI Benchmark Tasks
ACL 2022
Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants
NAACL 2022
Dynabench: Rethinking Benchmarking in NLP
NAACL 2021
Contrasting Human- and Machine-Generated Word-Level Adversarial Examples for Text Classification
EMNLP 2021
Improving Question Answering Model Robustness with Synthetic Adversarial Data Generation
EMNLP 2021
Undersensitivity in Neural Reading Comprehension
EMNLP 2020
Interpretation of Natural Language Rules in Conversational Machine Reading
EMNLP 2018