conftrace_

Yunze Xiao

13 papers · 2022–2026 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+5 more ↓

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🐝 Cross-Pollinator (9) 🌈 Renaissance Researcher (6)

🗺️ Taxonomy Completionist (30) 👥 Mega-Team (32) 🔥 Unstoppable (5) 💎 Century Club (11) 🗃️ Keyword Collector (54)

Conferences

EMNLP (5) ACL (3) AAAI (2) EACL (2) COLING (1)

Top co-authors

Mona T. Diab (4) Jiarui Liu (2) Qingcheng Zeng (2) Weihao Xuan (2) Heli Qi (2) Junjue Wang (2) Irene Li (2) Peidi Dong (1) Rui Yang (1) Xun Cao (1)

Keywords

large language model (7) multi-agent system (2) benchmark evaluation (2) zero-shot learning (1) few-shot learning (1) natural language processing (1) adversarial robustness (1) natural language inference (1) multilingual nlp (1) cross-lingual transfer (1) offensive language detection (1) content moderation (1) bias detection (1) confidence calibration (1) low-resource language (1) constraint satisfaction (1) adversarial attack (1) diffusion model (1) role-playing agent (1) text classification (1)

Papers

TartanMaroon: Multi-Agent Academic Advising with Iterative Negotiation and Transparent Collaboration ACL 2026 JiraiBench: A Bilingual Benchmark for Evaluating Large Language Models’ Detection of Human risky health behavior Content in Jirai Community EACL 2026 The Confidence Dichotomy: Analyzing and Mitigating Miscalibration in Tool-Use Agents ACL 2026 AniTales: End-to-End Multimodal Story Generation Through Natural Language Prompting (Student Abstract) AAAI 2026 Hire Your Anthropologist! Rethinking Culture Benchmarks Through an Anthropological Lens EACL 2026 Humanizing Machines: Rethinking LLM Anthropomorphism Through a Multi-Level Framework of Design EMNLP 2025 MMLU-ProX: A Multilingual Benchmark for Advanced Large Language Model Evaluation EMNLP 2025 Synthetic Socratic Debates: Examining Persona Effects on Moral Decision and Persuasion Dynamics EMNLP 2025 ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations EMNLP 2024 Verbing Weirds Language (Models): Evaluation of English Zero-Derivation in Five LLMs COLING 2024 InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews ACL 2024 Nexus at ArAIEval Shared Task: Fine-Tuning Arabic Language Models for Propaganda and Disinformation Detection EMNLP 2023 Detailed Facial Geometry Recovery from Multi-View Images by Learning an Implicit Function AAAI 2022