Youliang Yuan

18 papers · 2024–2026 · 6 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (13) 🌍 Conference Polyglot (6) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (6)

🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (31) 👥 Mega-Team (35) 🤝 Dynamic Duo (14) ❓ The Questioner ⚡ Prolific Year (10) 💎 Century Club (16) 🗃️ Keyword Collector (66)

Conferences

ACL (7) EMNLP (5) ICLR (3) COLING (1) ICML (1) NAACL (1)

Top co-authors

Wenxuan Wang (15) Jen-tse Huang (13) Pinjia He (10) Wenxiang Jiao (9) Zhaopeng Tu (8) Michael Lyu (5) Xiaoyuan Liu (3) Shuai Wang (2) Eric John Li (2) Man Ho Lam (2)

Keywords

large language model (8) benchmark evaluation (4) multimodal large language model (3) model safety (2) visual question answering (2) agent system (2) prompt engineering (2) commonsense knowledge (1) ai safety (1) logical reasoning (1) implicit bia (1) automated reasoning (1) harmful content (1) safety alignment (1) responsible ai (1) reward hacking (1) adversarial defense (1) instruction following (1) adversarial attack (1) model alignment (1)

Papers

Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards ACL 2026 SHAPE: Unifying Safety, Helpfulness and Pedagogy for Educational LLMs ACL 2026 Can’t See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs ACL 2025 Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs ACL 2025 Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing ACL 2025 VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models EMNLP 2025 Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training ACL 2025 ToolSafety: A Comprehensive Dataset for Enhancing Safety in LLM-Based Agent Tool Invocations EMNLP 2025 Learning to Ask: When LLM Agents Meet Unclear Instruction EMNLP 2025 Competing Large Language Models in Multi-Agent Gaming Environments ICLR 2025 On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents ICML 2025 Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability NAACL 2025 GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher ICLR 2024 Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs EMNLP 2024 Does ChatGPT Know That It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT COLING 2024 All Languages Matter: On the Multilingual Safety of LLMs ACL 2024 LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models EMNLP 2024 On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs ICLR 2024