conftrace_

Erik Jones

11 papers · 2020–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (4) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (12)

🏃 Academic Marathon (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 💎 Century Club (11) 🔥 Unstoppable (6) ❓ The Questioner

Conferences

ICLR (4) ICML (4) NIPS (2) ACL (1)

Top co-authors

Jacob Steinhardt (6) Aditi Raghunathan (2) Percy Liang (2) Varun Chandrasekaran (2) Hamid Palangi (2) Anca Dragan (2) Ece Kamar (2) Meena Jagadeesan (1) Robert Kirk (1) Sara Price (1)

Keywords

large language model (2) natural language processing (1) multimodal learning (1) toxicity detection (1) code generation (1) bert model (1) model safety (1) discrete optimization (1) language model (1) evaluation benchmark (1) failure detection (1) cognitive bia (1) clip model (1) adversarial testing (1) error analysis (1) model auditing (1) multimodal system (1) system evaluation (1) reliability assessment (1) systematic failure (1)

Papers

Uncovering Gaps in How Humans and LLMs Interpret Subjective Language ICLR 2025 How Do Large Language Monkeys Get Their Power (Laws)? ICML 2025 Adversaries Can Misuse Combinations of Safe Models ICML 2025 Attention Satisfies: A Constraint-Satisfaction Lens on Factual Errors of Language Models ICLR 2024 Teaching Language Models to Hallucinate Less with Synthetic Tasks ICLR 2024 Feedback Loops With Language Models Drive In-Context Reward Hacking ICML 2024 Automatically Auditing Large Language Models via Discrete Optimization ICML 2023 Mass-Producing Failures of Multimodal Systems with Language Models NIPS 2023 Capturing Failures of Large Language Models via Human Cognitive Biases NIPS 2022 Selective Classification Can Magnify Disparities Across Groups ICLR 2021 Robust Encodings: A Framework for Combating Adversarial Typos ACL 2020