Yunze Xiao
13 papers · 2022–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (9) π Renaissance Researcher (6)
πΊοΈ
Taxonomy Completionist
(30)
π₯
Mega-Team
(32)
π₯
Unstoppable
(5)
π
Century Club
(11)
ποΈ
Keyword Collector
(54)
Conferences
EMNLP (5)
ACL (3)
AAAI (2)
EACL (2)
COLING (1)
Top co-authors
Keywords
large language model
(7)
multi-agent system
(2)
benchmark evaluation
(2)
zero-shot learning
(1)
few-shot learning
(1)
natural language processing
(1)
adversarial robustness
(1)
natural language inference
(1)
multilingual nlp
(1)
cross-lingual transfer
(1)
offensive language detection
(1)
content moderation
(1)
bias detection
(1)
confidence calibration
(1)
low-resource language
(1)
constraint satisfaction
(1)
adversarial attack
(1)
diffusion model
(1)
role-playing agent
(1)
text classification
(1)
Papers
TartanMaroon: Multi-Agent Academic Advising with Iterative Negotiation and Transparent Collaboration
ACL 2026
JiraiBench: A Bilingual Benchmark for Evaluating Large Language Modelsβ Detection of Human risky health behavior Content in Jirai Community
EACL 2026
The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents
ACL 2026
AniTales: End-to-End Multimodal Story Generation Through Natural Language Prompting (Student Abstract)
AAAI 2026
Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens
EACL 2026
Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design
EMNLP 2025
MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation
EMNLP 2025
Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics
EMNLP 2025
ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations
EMNLP 2024
Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs
COLING 2024
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
ACL 2024
Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation Detection
EMNLP 2023
Detailed Facial Geometry Recovery from Multi-View Images by Learning an Implicit Function
AAAI 2022