Edoardo Debenedetti
6 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
๐งญ Keyword Pioneer ๐ฃ Hot Topic Early Bird ๐ Conference Polyglot (3) ๐ Interdisciplinary Bridge ๐ Cross-Pollinator (12)
๐ฅ
Mega-Team
(21)
๐
Keyword Champion
(2)
Conferences
NIPS (3)
ICLR (2)
ICML (1)
Top co-authors
Keywords
large language model
(3)
adversarial learning
(2)
adversarial attack
(2)
prompt injection
(2)
security evaluation
(2)
safety benchmark
(1)
robustness evaluation
(1)
ai agent
(1)
tool execution
(1)
agent system
(1)
llm agent
(1)
defense mechanism
(1)
llm safety
(1)
security vulnerability
(1)
llm robustness
(1)
benchmark evaluation
(1)
model defense
(1)
adversarial robustness
(1)
ai safety
(1)
jailbreak attack
(1)
Papers
Measuring Non-Adversarial Reproduction of Training Data in Large Language Models
ICLR 2025
Adversarial Search Engine Optimization for Large Language Models
ICLR 2025
AutoAdvExBench: Benchmarking Autonomous Exploitation of Adversarial Example Defenses
ICML 2025
Dataset and Lessons Learned from the 2024 SaTML LLM Capture-the-Flag Competition
NIPS 2024
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language Models
NIPS 2024
AgentDojo: A Dynamic Environment to Evaluate Prompt Injection Attacks and Defenses for LLM Agents
NIPS 2024