conftrace_

Yixu Wang

12 papers · 2022–2026 · 8 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🌈 Renaissance Researcher (6) 🧭 Keyword Pioneer 🐝 Cross-Pollinator (12) 🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge

🧭 Keyword Pioneer 🌍 Conference Polyglot (8) 🤝 Dynamic Duo (10) ⚡ Prolific Year (5) ❓ The Questioner 💎 Century Club (10)

Conferences

AAAI (2) ACL (2) ICCV (2) NAACL (2) ECCV (1) EMNLP (1) ICML (1) NIPS (1)

Top co-authors

Yan Teng (11) Yingchun Wang (11) Kexin Huang (5) Tianle Gu (4) Xingjun Ma (4) Haiquan Zhao (3) Lingyu Li (3) Yang Yao (3) Yu-Gang Jiang (2) Jie Li (2)

Research topics

Keywords

large language model (4) jailbreak attack (3) model extraction (2) safety evaluation (2) adversarial learning (2) model safety (1) privacy preservation (1) semi-supervised learning (1) backdoor attack (1) machine learning (1) adversarial attack (1) diffusion model (1) synthetic datum (1) parameter-efficient fine-tuning (1) bias detection (1) bi-level optimization (1) vision-language model (1) safety alignment (1) multimodal large language model (1) latent space (1)

Papers

The Other Mind: How Language Models Exhibit Human Temporal Cognition AAAI 2026 Probing the Safety Robustness of LLMs in Latent Space ACL 2026 A Mousetrap: Fooling Large Reasoning Models for Jailbreak with Chain of Iterative Chaos ACL 2025 StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data ICCV 2025 IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves ICCV 2025 HoneypotNet: Backdoor Attacks Against Model Extraction AAAI 2025 Reflection-Bench: Evaluating Epistemic Agency in Large Language Models ICML 2025 Flames: Benchmarking Value Alignment of LLMs in Chinese NAACL 2024 Fake Alignment: Are LLMs Really Aligned Well? NAACL 2024 ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models EMNLP 2024 MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models NIPS 2024 Black-Box Dissector: Towards Erasing-Based Hard-Label Model Stealing Attack ECCV 2022