Gelei Deng
4 papers · 2024–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓
π
Conference Polyglot
(3)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
EMNLP (2)
ACL (1)
ICLR (1)
Top co-authors
Keywords
large language model
(3)
jailbreak attack
(2)
model robustness
(1)
prompt engineering
(1)
multimodal learning
(1)
ai safety
(1)
supervised fine-tuning
(1)
adversarial prompt
(1)
red teaming
(1)
safety filter
(1)
jailbreak defense
(1)
prompt injection
(1)
safety training
(1)
harmful content generation
(1)
large audio-language model
(1)
multi-agent system
(1)
audio understanding
(1)
modality bia
(1)
modality conflict
(1)
adversarial learning
(1)
Papers
When Audio and Text Disagree: Revealing Text Bias in Large Audio-Language Models
EMNLP 2025
TombRaider: Entering the Vault of History to Jailbreak Large Language Models
EMNLP 2025
Fine-Grained Verifiers: Preference Modeling as Next-token Prediction in Vision-Language Alignment
ICLR 2025
A Comprehensive Study of Jailbreak Attack versus Defense for Large Language Models
ACL 2024