Yingchun Wang

14 papers · 2024–2026 · 7 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌈 Renaissance Researcher (6) 🐝 Cross-Pollinator (12) 🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge

🗺️ Taxonomy Completionist (26) 🧭 Keyword Pioneer 🤝 Dynamic Duo (10) ❓ The Questioner ⚡ Prolific Year (7) 🗃️ Keyword Collector (59) 💎 Century Club (11)

Conferences

ACL (4) AAAI (2) EMNLP (2) ICCV (2) NAACL (2) ICML (1) NIPS (1)

Top co-authors

Yan Teng (12) Yixu Wang (11) Kexin Huang (6) Tianle Gu (5) Xingjun Ma (5) Yang Yao (3) Yujiu Yang (3) Haiquan Zhao (3) Lingyu Li (3) Yuanqi Yao (2)

Research topics

Privacy (1)

Keywords

large language model (5) jailbreak attack (3) chain of thought (2) adversarial learning (2) safety evaluation (2) model extraction (2) chain-of-thought reasoning (1) model safety (1) constrained reinforcement learning (1) privacy preservation (1) confidence calibration (1) knowledge unlearning (1) ai safety (1) safety alignment (1) reward modeling (1) diffusion model (1) adversarial attack (1) bias detection (1) synthetic datum (1) backdoor attack (1)

Papers

Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with Constraints ACL 2026 Probing the Safety Robustness of LLMs in Latent Space ACL 2026 The Other Mind: How Language Models Exhibit Human Temporal Cognition AAAI 2026 From Evasion to Concealment: Stealthy Knowledge Unlearning for LLMs ACL 2025 Beyond Correctness: Confidence-Aware Reward Modeling for Enhancing Large Language Model Reasoning EMNLP 2025 StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data ICCV 2025 Reflection-Bench: Evaluating Epistemic Agency in Large Language Models ICML 2025 IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves ICCV 2025 HoneypotNet: Backdoor Attacks Against Model Extraction AAAI 2025 A Mousetrap: Fooling Large Reasoning Models for Jailbreak with Chain of Iterative Chaos ACL 2025 Fake Alignment: Are LLMs Really Aligned Well? NAACL 2024 Flames: Benchmarking Value Alignment of LLMs in Chinese NAACL 2024 MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models NIPS 2024 ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models EMNLP 2024