conftrace_

Edoardo Debenedetti

6 papers · 2024–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+2 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)

👥 Mega-Team (21) 🏆 Keyword Champion (2)

Conferences

NIPS (3) ICLR (2) ICML (1)

Top co-authors

Florian Tramer (6) Javier Rando (3) Nicholas Carlini (2) Mario Fritz (1) Vikash Sehwag (1) Francesco Croce (1) Michael Aerni (1) George J. Pappas (1) Eric Wong (1) Lea Schönherr (1)

Keywords

large language model (3) adversarial learning (2) adversarial attack (2) prompt injection (2) security evaluation (2) safety benchmark (1) robustness evaluation (1) ai agent (1) tool execution (1) agent system (1) llm agent (1) defense mechanism (1) llm safety (1) security vulnerability (1) llm robustness (1) benchmark evaluation (1) model defense (1) adversarial robustness (1) ai safety (1) jailbreak attack (1)

Papers

Measuring Non-Adversarial Reproduction of Training Data in Large Language Models ICLR 2025 Adversarial Search Engine Optimization for Large Language Models ICLR 2025 AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses ICML 2025 Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition NIPS 2024 JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models NIPS 2024 AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents NIPS 2024