conftrace_

Xuxin Cheng

55 papers · 2022–2026 · 16 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+11 more ↓

🐝 Cross-Pollinator (4) 🌍 Conference Polyglot (15) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (8)

🌈 Renaissance Researcher (8) 🗺️ Taxonomy Completionist (85) 🏆 Keyword Champion (16) 🔬 Deep Specialist (10) 🤝 Dynamic Duo (32) ⚡ Prolific Year (7) 🚀 Conference Pioneer 🗃️ Keyword Collector (161) 💎 Century Club (53) 🔥 Unstoppable (5) ❓ The Questioner

Conferences

ACL (9) EMNLP (8) AAAI (6) CORL (6) INTERSPEECH (6) COLING (4) ICLR (4) ECCV (2) ICCV (2) RSS (2) CVPR (1) EACL (1) IJCAI (1) MICCAI (1) NAACL (1) NIPS (1)

Top co-authors

Yuexian Zou (32) Zhihong Zhu (31) Hongxiang Li (17) Zhiqi Huang (14) Yaowei Li (13) Xianwei Zhuang (11) Xiaolong Wang (7) Dongsheng Chen (6) Shiqi Yang (5) Ziyu Yao (5)

Research topics

Understanding (1)

Keywords

spoken language understanding (16) contrastive learning (10) slot filling (8) intent detection (6) large language model (6) task-oriented dialogue (5) automatic speech recognition (5) intent classification (4) multi-task learning (4) zero-shot learning (4) reinforcement learning (4) multimodal learning (4) optimal transport (3) cross-lingual transfer (3) pre-trained language model (3) whole-body control (3) transfer learning (2) sim-to-real transfer (2) video understanding (2) data augmentation (2)

Papers

MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models EACL 2026 SILO-BENCH: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems ACL 2026 EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction AAAI 2025 CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model CVPR 2025 AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control RSS 2025 UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation ICLR 2025 Humanoid Policy Human Policy CORL 2025 DisPose: Disentangling Pose Guidance for Controllable Human Image Animation ICLR 2025 ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training CORL 2025 PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling ACL 2024 Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation ACL 2024 Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup ACL 2024 Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment ACL 2024 Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding ACL 2024 MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts ACL 2024 Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics COLING 2024 Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic COLING 2024 Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling COLING 2024 Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study COLING 2024 KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval ECCV 2024 Uncertainty-aware sign language video retrieval with probability distribution modeling ECCV 2024 DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval INTERSPEECH 2024 Visual Whole-Body Control for Legged Loco-Manipulation CORL 2024 Open-TeleVision: Teleoperation with Immersive Active Visual Feedback CORL 2024 ACE: A Cross-platform and visual-Exoskeletons System for Low-Cost Dexterous Teleoperation CORL 2024 Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning AAAI 2024 Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport AAAI 2024 Exploiting Auxiliary Caption for Video Grounding AAAI 2024 Aligner²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment AAAI 2024 Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling AAAI 2024 What are the Generator Preferences for End-to-end Task-Oriented Dialog System? EMNLP 2024 RAG-HAT: A Hallucination-Aware Tuning Pipeline for LLM in Retrieval-Augmented Generation EMNLP 2024 Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System EMNLP 2024 PolyVoice: Language Models for Speech to Speech Translation ICLR 2024 Retrieval is Accurate Generation ICLR 2024 Generating More Audios for End-to-End Spoken Language Understanding IJCAI 2024 Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation INTERSPEECH 2024 Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation MICCAI 2024 MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration NAACL 2024 Expressive Whole-Body Control for Humanoid Robots RSS 2024 Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations ACL 2023 Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence EMNLP 2023 ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding ACL 2023 FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding INTERSPEECH 2023 C²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding INTERSPEECH 2023 Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation EMNLP 2023 MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling EMNLP 2023 Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention EMNLP 2023 GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering INTERSPEECH 2023 Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning INTERSPEECH 2023 MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding EMNLP 2023 Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation ICCV 2023 G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory ICCV 2023 Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning NIPS 2023 Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion CORL 2022