Xiaorui Wang
18 papers · 2018–2026 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Conference Polyglot (6)
π§
Keyword Pioneer
π
Renaissance Researcher
(5)
π§¬
Topic Evolution
π₯
Mega-Team
(20)
ποΈ
Keyword Collector
(82)
π
Century Club
(15)
β‘
Prolific Year
(6)
Conferences
INTERSPEECH (8)
ACL (3)
AAAI (2)
IJCAI (2)
EMNLP (1)
ICCV (1)
ICLR (1)
Top co-authors
Keywords
audio source separation
(3)
multi-task learning
(2)
transformer architecture
(2)
multimodal learning
(2)
signal-to-distortion ratio
(2)
image generation
(1)
few-shot learning
(1)
contrastive learning
(1)
speech recognition
(1)
spoken language understanding
(1)
data augmentation
(1)
cross-modal learning
(1)
channel attention
(1)
neural architecture search
(1)
web corpus
(1)
knowledge distillation
(1)
intent classification
(1)
language model alignment
(1)
visual grounding
(1)
feature extraction
(1)
Papers
MCP-AgentBench: Evaluating Real-World Language Agent Performance with MCP-Mediated Tools
AAAI 2026
Thinker: Training LLMs in Hierarchical Thinking for Deep Search via Multi-Turn Interaction
AAAI 2026
FS-Researcher: Test-Time Scaling for Long-Horizon Research Tasks with File-System-Based Agents
ACL 2026
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
IJCAI 2025
IterMeme: Expert-Guided Multimodal LLM for Interactive Meme Creation with Layout-Aware Generation
IJCAI 2025
From Real to Synthetic: Synthesizing Millions of Diversified and Complicated User Instructions with Attributed Grounding
ACL 2025
IGD: Instructional Graphic Design with Multimodal Layer Generation
ICCV 2025
SAGEPhos: Sage Bio-Coupled and Augmented Fusion for Phosphorylation Site Detection
ICLR 2025
Automated Creativity Evaluation for Large Language Models: A Reference-Based Approach
EMNLP 2025
Image-driven Audio-visual Universal Source Separation
INTERSPEECH 2023
Parameter-Efficient Conformers via Sharing Sparsely-Gated Experts for End-to-End Speech Recognition
INTERSPEECH 2022
ChipSong: A Controllable Lyric Generation System for Chinese Popular Song
ACL 2022
Improving Spoken Language Understanding with Cross-Modal Contrastive Learning
INTERSPEECH 2022
iCNN-Transformer: An improved CNN-Transformer with Channel-spatial Attention and Keyword Prediction for Automated Audio Captioning
INTERSPEECH 2022
Conformer Space Neural Architecture Search for Multi-Task Audio Separation
INTERSPEECH 2022
WA-Transformer: Window Attention-based Transformer with Two-stage Strategy for Multi-task Audio Source Separation
INTERSPEECH 2022
Dynamic Multi-Scale Convolution for Dialect Identification
INTERSPEECH 2021
Gated Recurrent Unit Based Acoustic Modeling with Future Context
INTERSPEECH 2018