Youliang Yuan
18 papers · 2024–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Cross-Pollinator (13) π Conference Polyglot (6) π Interdisciplinary Bridge π§ Keyword Pioneer π Renaissance Researcher (6)
π
Renaissance Researcher
(6)
πΊοΈ
Taxonomy Completionist
(31)
π₯
Mega-Team
(35)
π€
Dynamic Duo
(14)
β
The Questioner
β‘
Prolific Year
(10)
π
Century Club
(16)
ποΈ
Keyword Collector
(66)
Conferences
ACL (7)
EMNLP (5)
ICLR (3)
COLING (1)
ICML (1)
NAACL (1)
Top co-authors
Keywords
large language model
(8)
benchmark evaluation
(4)
multimodal large language model
(3)
model safety
(2)
visual question answering
(2)
agent system
(2)
prompt engineering
(2)
commonsense knowledge
(1)
ai safety
(1)
logical reasoning
(1)
implicit bia
(1)
automated reasoning
(1)
harmful content
(1)
safety alignment
(1)
responsible ai
(1)
reward hacking
(1)
adversarial defense
(1)
instruction following
(1)
adversarial attack
(1)
model alignment
(1)
Papers
Curing Miracle Steps in LLM Mathematical Reasoning with Rubric Rewards
ACL 2026
SHAPE: Unifying Safety, Helpfulness and Pedagogy for Educational LLMs
ACL 2026
Canβt See the Forest for the Trees: Benchmarking Multimodal Safety Awareness for Multimodal LLMs
ACL 2025
Insight Over Sight: Exploring the Vision-Knowledge Conflicts in Multimodal LLMs
ACL 2025
Chain-of-Jailbreak Attack for Image Generation Models via Step by Step Editing
ACL 2025
VisBias: Measuring Explicit and Implicit Social Biases in Vision Language Models
EMNLP 2025
Refuse Whenever You Feel Unsafe: Improving Safety in LLMs via Decoupled Refusal Training
ACL 2025
ToolSafety: A Comprehensive Dataset for Enhancing Safety in LLM-Based Agent Tool Invocations
EMNLP 2025
Learning to Ask: When LLM Agents Meet Unclear Instruction
EMNLP 2025
Competing Large Language Models in Multi-Agent Gaming Environments
ICLR 2025
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents
ICML 2025
Libra-Leaderboard: Towards Responsible AI through a Balanced Leaderboard of Safety and Capability
NAACL 2025
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
ICLR 2024
Difficult Task Yes but Simple Task No: Unveiling the Laziness in Multimodal LLMs
EMNLP 2024
Does ChatGPT Know That It Does Not Know? Evaluating the Black-Box Calibration of ChatGPT
COLING 2024
All Languages Matter: On the Multilingual Safety of LLMs
ACL 2024
LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models
EMNLP 2024
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
ICLR 2024