conftrace_

Gelei Deng

4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

🌍 Conference Polyglot (3) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (15)

Conferences

EMNLP (2) ACL (1) ICLR (1)

Top co-authors

Yi Liu (2) Yuekang Li (2) XIANGLIN YANG (1) Tianwei Zhang (1) Jiahao Zhang (1) Ziqi Ding (1) Yiyang Zhou (1) Han Qiu (1) Cheng Wang (1) Zhaorun Chen (1)

Keywords

large language model (3) jailbreak attack (2) model robustness (1) prompt engineering (1) multimodal learning (1) ai safety (1) supervised fine-tuning (1) adversarial prompt (1) red teaming (1) safety filter (1) jailbreak defense (1) prompt injection (1) safety training (1) harmful content generation (1) large audio-language model (1) multi-agent system (1) audio understanding (1) modality bia (1) modality conflict (1) adversarial learning (1)

Papers

When Audio and Text Disagree: Revealing Text Bias in Large Audio-Language Models EMNLP 2025 TombRaider: Entering the Vault of History to Jailbreak Large Language Models EMNLP 2025 Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment ICLR 2025 A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models ACL 2024