Xuxin Cheng
55 papers · 2022–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Cross-Pollinator (4) π Conference Polyglot (15) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (8)
π
Renaissance Researcher
(8)
πΊοΈ
Taxonomy Completionist
(85)
π
Keyword Champion
(16)
π¬
Deep Specialist
(10)
π€
Dynamic Duo
(32)
β‘
Prolific Year
(7)
π
Conference Pioneer
ποΈ
Keyword Collector
(161)
π
Century Club
(53)
π₯
Unstoppable
(5)
β
The Questioner
Conferences
ACL (9)
EMNLP (8)
AAAI (6)
CORL (6)
INTERSPEECH (6)
COLING (4)
ICLR (4)
ECCV (2)
ICCV (2)
RSS (2)
CVPR (1)
EACL (1)
IJCAI (1)
MICCAI (1)
NAACL (1)
NIPS (1)
Top co-authors
Research topics
Keywords
spoken language understanding
(16)
contrastive learning
(10)
slot filling
(8)
intent detection
(6)
large language model
(6)
task-oriented dialogue
(5)
automatic speech recognition
(5)
intent classification
(4)
multi-task learning
(4)
zero-shot learning
(4)
reinforcement learning
(4)
multimodal learning
(4)
optimal transport
(3)
cross-lingual transfer
(3)
pre-trained language model
(3)
whole-body control
(3)
transfer learning
(2)
sim-to-real transfer
(2)
video understanding
(2)
data augmentation
(2)
Papers
MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models
EACL 2026
SILO-BENCH: A Scalable Environment for Evaluating Distributed Coordination in Multi-Agent LLM Systems
ACL 2026
EXCGEC: A Benchmark for Edit-Wise Explainable Chinese Grammatical Error Correction
AAAI 2025
CountLLM: Towards Generalizable Repetitive Action Counting via Large Language Model
CVPR 2025
AMO: Adaptive Motion Optimization for Hyper-Dexterous Humanoid Whole-Body Control
RSS 2025
UniCoTT: A Unified Framework for Structural Chain-of-Thought Distillation
ICLR 2025
Humanoid Policy Β Human Policy
CORL 2025
DisPose: Disentangling Pose Guidance for Controllable Human Image Animation
ICLR 2025
ManiFlow: A General Robot Manipulation Policy via Consistency Flow Training
CORL 2025
PCAD: Towards ASR-Robust Spoken Language Understanding via Prototype Calibration and Asymmetric Decoupling
ACL 2024
Enhancing Dialogue State Tracking Models through LLM-backed User-Agents Simulation
ACL 2024
Soul-Mix: Enhancing Multimodal Machine Translation with Manifold Mixup
ACL 2024
Code-Switching Can be Better Aligners: Advancing Cross-Lingual SLU through Representation-Level and Prediction-Level Alignment
ACL 2024
Cyclical Contrastive Learning Based on Geodesic for Zero-shot Cross-lingual Spoken Language Understanding
ACL 2024
MoE-SLU: Towards ASR-Robust Spoken Language Understanding via Mixture-of-Experts
ACL 2024
Alignment before Awareness: Towards Visual Question Localized-Answering in Robotic Surgery via Optimal Transport and Answer Semantics
COLING 2024
Knowledge-enhanced Prompt Tuning for Dialogue-based Relation Extraction with Trigger and Label Semantic
COLING 2024
Towards Multi-modal Sarcasm Detection via Disentangled Multi-grained Multi-modal Distilling
COLING 2024
Zero-Shot Spoken Language Understanding via Large Language Models: A Preliminary Study
COLING 2024
KDProR: A Knowledge-Decoupling Probabilistic Framework for Video-Text Retrieval
ECCV 2024
Uncertainty-aware sign language video retrieval with probability distribution modeling
ECCV 2024
DiffATR: Diffusion-based Generative Modeling for Audio-Text Retrieval
INTERSPEECH 2024
Visual Whole-Body Control for Legged Loco-Manipulation
CORL 2024
Open-TeleVision: Teleoperation with Immersive Active Visual Feedback
CORL 2024
ACE: A Cross-platform and visual-Exoskeletons System for Low-Cost Dexterous Teleoperation
CORL 2024
Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
AAAI 2024
Towards Multi-Intent Spoken Language Understanding via Hierarchical Attention and Optimal Transport
AAAI 2024
Exploiting Auxiliary Caption for Video Grounding
AAAI 2024
AlignerΒ²: Enhancing Joint Multiple Intent Detection and Slot Filling via Adjustive and Forced Cross-Task Alignment
AAAI 2024
Towards Explainable Joint Models via Information Theory for Multiple Intent Detection and Slot Filling
AAAI 2024
What are the Generator Preferences for End-to-end Task-Oriented Dialog System?
EMNLP 2024
RAG-HAT: A Hallucination-Aware Tuning Pipeline for LLM in Retrieval-Augmented Generation
EMNLP 2024
Learning to Match Representations is Better for End-to-End Task-Oriented Dialog System
EMNLP 2024
PolyVoice: Language Models for Speech to Speech Translation
ICLR 2024
Retrieval is Accurate Generation
ICLR 2024
Generating More Audios for End-to-End Spoken Language Understanding
IJCAI 2024
Audio-text Retrieval with Transformer-based Hierarchical Alignment and Disentangled Cross-modal Representation
INTERSPEECH 2024
Multivariate Cooperative Game for Image-Report Pairs: Hierarchical Semantic Alignment for Medical Report Generation
MICCAI 2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration
NAACL 2024
Expressive Whole-Body Control for Humanoid Robots
RSS 2024
Towards Unified Spoken Language Understanding Decoding via Label-aware Compact Linguistics Representations
ACL 2023
Enhancing Code-Switching for Cross-lingual SLU: A Unified View of Semantic and Grammatical Coherence
EMNLP 2023
ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding
ACL 2023
FC-MTLF: A Fine- and Coarse-grained Multi-Task Learning Framework for Cross-Lingual Spoken Language Understanding
INTERSPEECH 2023
CΒ²A-SLU: Cross and Contrastive Attention for Improving ASR Robustness in Spoken Language Understanding
INTERSPEECH 2023
Accelerating Multiple Intent Detection and Slot Filling via Targeted Knowledge Distillation
EMNLP 2023
MRRL: Modifying the Reference via Reinforcement Learning for Non-Autoregressive Joint Multiple Intent Detection and Slot Filling
EMNLP 2023
Syntax Matters: Towards Spoken Language Understanding via Syntax-Aware Attention
EMNLP 2023
GhostT5: Generate More Features with Cheap Operations to Improve Textless Spoken Question Answering
INTERSPEECH 2023
Mix before Align: Towards Zero-shot Cross-lingual Sentiment Analysis via Soft-Mix and Multi-View Learning
INTERSPEECH 2023
MCLF: A Multi-grained Contrastive Learning Framework for ASR-robust Spoken Language Understanding
EMNLP 2023
Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
ICCV 2023
G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
ICCV 2023
Discover and Align Taxonomic Context Priors for Open-world Semi-Supervised Learning
NIPS 2023
Deep Whole-Body Control: Learning a Unified Policy for Manipulation and Locomotion
CORL 2022