conftrace_

Papers

187,652 papers found · 36,278 more still awaiting a processed abstract Show those too
2026 ACL
Safeguarding Language Models via Self-Destruct Trapdoor
Shahar Katz, Bar Alon, Ariel Shaulov et al.
2026 EACL
2026 ACL
SafeLens: Segment-Level Hate Speech Detection in Online Videos
Zhuoran Wang, Dylan Raharja, Yujia Hu et al.
2026 AAAI
SafeMIL: Learning Offline Safe Imitation Policy from Non-Preferred Trajectories
Returaj Burnwal, Nirav Pravinbhai Bhatt, Balaraman Ravindran
2026 AAAI
SafeMT: Multi-turn Safety for Multimodal Language Models
Han Zhu, Juntao Dai, Jiaming Ji et al.
2026 ACL
2026 AAAI
Safe RAG by RAG: Untying the Bell That RAG Rang with the RAG Hand
Xun Liang, Mengwei Wang, Yuefeng Ma et al.
2026 AAAI
SafeSearch: Do Not Trade Safety for Utility in LLM Search Agents
Qiusi Zhan, Angeline Budiman-Chan, Abdelrahman Zayed et al.
2026 EACL
2026 AAAI
2026 ACL
Safety of Large Language Models Beyond English: A Systematic Literature Review of Risks, Biases, and Safeguards
Aleksandra Krasnodębska, Katarzyna Dziewulska, Karolina Seweryn et al.
2026 EACL