Yixu Wang
12 papers · 2022–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🌈 Renaissance Researcher (6) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge
🧭
Keyword Pioneer
🌍
Conference Polyglot
(8)
🤝
Dynamic Duo
(10)
⚡
Prolific Year
(5)
❓
The Questioner
💎
Century Club
(10)
Conferences
AAAI (2)
ACL (2)
ICCV (2)
NAACL (2)
ECCV (1)
EMNLP (1)
ICML (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(4)
jailbreak attack
(3)
model extraction
(2)
safety evaluation
(2)
adversarial learning
(2)
model safety
(1)
privacy preservation
(1)
semi-supervised learning
(1)
backdoor attack
(1)
machine learning
(1)
adversarial attack
(1)
diffusion model
(1)
synthetic datum
(1)
parameter-efficient fine-tuning
(1)
bias detection
(1)
bi-level optimization
(1)
vision-language model
(1)
safety alignment
(1)
multimodal large language model
(1)
latent space
(1)
Papers
The Other Mind: How Language Models Exhibit Human Temporal Cognition
AAAI 2026
Probing the Safety Robustness of LLMs in Latent Space
ACL 2026
A Mousetrap: Fooling Large Reasoning Models for Jailbreak with Chain of Iterative Chaos
ACL 2025
StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
ICCV 2025
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
ICCV 2025
HoneypotNet: Backdoor Attacks Against Model Extraction
AAAI 2025
Reflection-Bench: Evaluating Epistemic Agency in Large Language Models
ICML 2025
Flames: Benchmarking Value Alignment of LLMs in Chinese
NAACL 2024
Fake Alignment: Are LLMs Really Aligned Well?
NAACL 2024
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
EMNLP 2024
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
NIPS 2024
Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack
ECCV 2022