Xianwei Zhuang

20 papers · 2024–2026 · 9 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🐝 Cross-Pollinator (8) 🌍 Conference Polyglot (9) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌈 Renaissance Researcher (8)

🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (48) 🏆 Keyword Champion (3) 🤝 Dynamic Duo (16) ❓ The Questioner (2) ⚡ Prolific Year (15) 🗃️ Keyword Collector (91) 💎 Century Club (19)

Conferences

ACL (6) EMNLP (4) AAAI (3) ECCV (2) CVPR (1) ICLR (1) IJCAI (1) INTERSPEECH (1) NAACL (1)

Top co-authors

Yuexian Zou (17) Zhihong Zhu (14) Xuxin Cheng (11) Yuxin Xie (7) Liming Liang (6) Zhanpeng Chen (6) Hongxiang Li (5) Zhiqi Huang (4) Zhichang Wang (4) Bang Yang (2)

Keywords

contrastive learning (5) spoken language understanding (5) multimodal learning (4) large vision-language model (3) slot filling (3) visual hallucination (3) intent classification (2) attention mechanism (2) automatic speech recognition (2) optimal transport (2) vision-language model (2) audio-text retrieval (2) causal inference (2) task-oriented dialogue (2) intent detection (2) multilingual retrieval (1) multilingual nlp (1) domain adaptation (1) preference learning (1) multi-task learning (1)

Papers

Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation AAAI 2026 UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation ICLR 2025 ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors ACL 2025 Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models ACL 2025 VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification CVPR 2025 MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts ACL 2024 KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval ECCV 2024 Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport AAAI 2024 Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System EMNLP 2024 What are the Generator Preferences for End-to-end Task-Oriented Dialog System? EMNLP 2024 Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection EMNLP 2024 Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory EMNLP 2024 TFCD: Towards Multi-modal Sarcasm Detection via Training-Free Counterfactual Debiasing IJCAI 2024 GPA: Global and Prototype Alignment for Audio-Text Retrieval INTERSPEECH 2024 MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration NAACL 2024 Uncertainty-aware sign language video retrieval with probability distribution modeling ECCV 2024 Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling AAAI 2024 PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling ACL 2024 Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment ACL 2024 Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding ACL 2024