Xiaorui Wang

18 papers · 2018–2026 · 7 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🌍 Conference Polyglot (6)

🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🧬 Topic Evolution 👥 Mega-Team (20) 🗃️ Keyword Collector (82) 💎 Century Club (15) ⚡ Prolific Year (6)

Conferences

INTERSPEECH (8) ACL (3) AAAI (2) IJCAI (2) EMNLP (1) ICCV (1) ICLR (1)

Top co-authors

Zhendong Mao (5) Benfeng Xu (5) Feng Deng (4) Chiwei Zhu (4) Yang Wang (3) Chenxing Li (3) Peng Yao (2) Shancheng Fang (2) Jianchao Tan (2) Yongdong Zhang (2)

Keywords

audio source separation (3) multi-task learning (2) transformer architecture (2) multimodal learning (2) signal-to-distortion ratio (2) image generation (1) few-shot learning (1) contrastive learning (1) speech recognition (1) spoken language understanding (1) data augmentation (1) cross-modal learning (1) channel attention (1) neural architecture search (1) web corpus (1) knowledge distillation (1) intent classification (1) language model alignment (1) visual grounding (1) feature extraction (1)

Papers

MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools AAAI 2026 Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction AAAI 2026 FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents ACL 2026 MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning IJCAI 2025 IterMeme: Expert-Guided Multimodal LLM for Interactive Meme Creation with Layout-Aware Generation IJCAI 2025 From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding ACL 2025 IGD: Instructional Graphic Design with Multimodal Layer Generation ICCV 2025 SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection ICLR 2025 Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach EMNLP 2025 Image-driven Audio-visual Universal Source Separation INTERSPEECH 2023 Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition INTERSPEECH 2022 ChipSong: A Controllable Lyric Generation System for Chinese Popular Song ACL 2022 Improving Spoken Language Understanding with Cross-Modal Contrastive Learning INTERSPEECH 2022 iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning INTERSPEECH 2022 Conformer Space Neural Architecture Search for Multi-Task Audio Separation INTERSPEECH 2022 WA-Transformer: Window Attention-based Transformer with Two-stage Strategy for Multi-task Audio Source Separation INTERSPEECH 2022 Dynamic Multi-Scale Convolution for Dialect Identification INTERSPEECH 2021 Gated Recurrent Unit Based Acoustic Modeling with Future Context INTERSPEECH 2018