Yingchun Wang
14 papers · 2024–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Renaissance Researcher (6) π Cross-Pollinator (12) π§ Keyword Pioneer π Conference Polyglot (7) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(26)
π§
Keyword Pioneer
π€
Dynamic Duo
(10)
β
The Questioner
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(59)
π
Century Club
(11)
Conferences
ACL (4)
AAAI (2)
EMNLP (2)
ICCV (2)
NAACL (2)
ICML (1)
NIPS (1)
Top co-authors
Research topics
Keywords
large language model
(5)
jailbreak attack
(3)
chain of thought
(2)
adversarial learning
(2)
safety evaluation
(2)
model extraction
(2)
chain-of-thought reasoning
(1)
model safety
(1)
constrained reinforcement learning
(1)
privacy preservation
(1)
confidence calibration
(1)
knowledge unlearning
(1)
ai safety
(1)
safety alignment
(1)
reward modeling
(1)
diffusion model
(1)
adversarial attack
(1)
bias detection
(1)
synthetic datum
(1)
backdoor attack
(1)
Papers
Deliberative Searcher: Improving LLM Reliability via Reinforcement Learning with Constraints
ACL 2026
Probing the Safety Robustness of LLMs in Latent Space
ACL 2026
The Other Mind: How Language Models Exhibit Human Temporal Cognition
AAAI 2026
From Evasion to Concealment: Stealthy Knowledge Unlearning for LLMs
ACL 2025
Beyond Correctness: Confidence-Aware Reward Modeling for Enhancing Large Language Model Reasoning
EMNLP 2025
StolenLoRA: Exploring LoRA Extraction Attacks via Synthetic Data
ICCV 2025
Reflection-Bench: Evaluating Epistemic Agency in Large Language Models
ICML 2025
IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
ICCV 2025
HoneypotNet: Backdoor Attacks Against Model Extraction
AAAI 2025
A Mousetrap: Fooling Large Reasoning Models for Jailbreak with Chain of Iterative Chaos
ACL 2025
Fake Alignment: Are LLMs Really Aligned Well?
NAACL 2024
Flames: Benchmarking Value Alignment of LLMs in Chinese
NAACL 2024
MLLMGuard: A Multi-dimensional Safety Evaluation Suite for Multimodal Large Language Models
NIPS 2024
ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
EMNLP 2024