Papers

2,781 papers found
2026 AAAI
2026 AAAI
2026 AAAI
Dynamic Deep Prompt Optimization for Defending Against Jailbreak Attacks on LLMs
Doniyorkhon Obidov, Honggang Yu, Xiaolong Guo et al.
2026 AAAI
DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs
Oluwanifemi Bamgbose, Masoud Hashemi, Sathwik Tejaswi Madhusudhan et al.
2026 AAAI
2026 AAAI
Silenced Biases: The Dark Side LLMs Learned to Refuse
Rom Himelstein, Amit LeVi, Brit Youngmann et al.
2026 AAAI
2026 AAAI
STAR-1: Safer Alignment of Reasoning LLMs with 1K Data
Zijun Wang, Haoqin Tu, Yuhan Wang et al.
2026 AAAI