Erik Jones
11 papers · 2020–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (4) π Academic Marathon (5) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (12)
π
Academic Marathon
(5)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π
Century Club
(11)
π₯
Unstoppable
(6)
β
The Questioner
Conferences
ICLR (4)
ICML (4)
NIPS (2)
ACL (1)
Top co-authors
Keywords
large language model
(2)
natural language processing
(1)
multimodal learning
(1)
toxicity detection
(1)
code generation
(1)
bert model
(1)
model safety
(1)
discrete optimization
(1)
language model
(1)
evaluation benchmark
(1)
failure detection
(1)
cognitive bia
(1)
clip model
(1)
adversarial testing
(1)
error analysis
(1)
model auditing
(1)
multimodal system
(1)
system evaluation
(1)
reliability assessment
(1)
systematic failure
(1)
Papers
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
ICLR 2025
How Do Large Language Monkeys Get Their Power (Laws)?
ICML 2025
Adversaries Can Misuse Combinations of Safe Models
ICML 2025
Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models
ICLR 2024
Teaching Language Models to Hallucinate Less with Synthetic Tasks
ICLR 2024
Feedback Loops With Language Models Drive In-Context Reward Hacking
ICML 2024
Automatically Auditing Large Language Models via Discrete Optimization
ICML 2023
Mass-Producing Failures of Multimodal Systems with Language Models
NIPS 2023
Capturing Failures of Large Language Models via Human Cognitive Biases
NIPS 2022
Selective Classification Can Magnify Disparities Across Groups
ICLR 2021
Robust Encodings: A Framework for Combating Adversarial Typos
ACL 2020