Xianwei Zhuang
20 papers · 2024–2026 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Cross-Pollinator (8) π Conference Polyglot (9) π Interdisciplinary Bridge π§ Keyword Pioneer π Renaissance Researcher (8)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(48)
π
Keyword Champion
(3)
π€
Dynamic Duo
(16)
β
The Questioner
(2)
β‘
Prolific Year
(15)
ποΈ
Keyword Collector
(91)
π
Century Club
(19)
Conferences
ACL (6)
EMNLP (4)
AAAI (3)
ECCV (2)
CVPR (1)
ICLR (1)
IJCAI (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Keywords
contrastive learning
(5)
spoken language understanding
(5)
multimodal learning
(4)
large vision-language model
(3)
slot filling
(3)
visual hallucination
(3)
intent classification
(2)
attention mechanism
(2)
automatic speech recognition
(2)
optimal transport
(2)
vision-language model
(2)
audio-text retrieval
(2)
causal inference
(2)
task-oriented dialogue
(2)
intent detection
(2)
multilingual retrieval
(1)
multilingual nlp
(1)
domain adaptation
(1)
preference learning
(1)
multi-task learning
(1)
Papers
Not All Tokens and Heads Are Equally Important: Dual-Level Attention Intervention for Hallucination Mitigation
AAAI 2026
UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation
ICLR 2025
ATRI: Mitigating Multilingual Audio Text Retrieval Inconsistencies by Reducing Data Distribution Errors
ACL 2025
Can We Trust AI Doctors? A Survey of Medical Hallucination in Large Language and Large Vision-Language Models
ACL 2025
VASparse: Towards Efficient Visual Hallucination Mitigation via Visual-Aware Token Sparsification
CVPR 2025
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
ACL 2024
KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval
ECCV 2024
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport
AAAI 2024
Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System
EMNLP 2024
What are the Generator Preferences for End-to-end Task-Oriented Dialog System?
EMNLP 2024
Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection
EMNLP 2024
Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory
EMNLP 2024
TFCD: Towards Multi-modal Sarcasm Detection via Training-Free Counterfactual Debiasing
IJCAI 2024
GPA: Global and Prototype Alignment for Audio-Text Retrieval
INTERSPEECH 2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration
NAACL 2024
Uncertainty-aware sign language video retrieval with probability distribution modeling
ECCV 2024
Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling
AAAI 2024
PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling
ACL 2024
Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment
ACL 2024
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding
ACL 2024