Xiong Wang
12 papers · 2021–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+5 more ↓ Show less ↑
π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Conference Polyglot (5)
π£
Hot Topic Early Bird
π
Cross-Pollinator
(11)
π
Century Club
(11)
ποΈ
Keyword Collector
(57)
π₯
Unstoppable
(5)
Conferences
INTERSPEECH (7)
ACL (2)
ICML (1)
IJCAI (1)
JMLR (1)
Top co-authors
Keywords
connectionist temporal classification
(2)
speech large language model
(2)
end-to-end model
(2)
keyword spotting
(2)
automatic speech recognition
(2)
end-to-end speech recognition
(2)
multimodal learning
(1)
speech processing
(1)
bayesian inference
(1)
speech enhancement
(1)
kernel learning
(1)
reproducing kernel hilbert space
(1)
discriminative training
(1)
speech recognition
(1)
regret bound
(1)
instruction following
(1)
multi-armed bandit
(1)
inverse problem
(1)
memory efficiency
(1)
noise robustness
(1)
Papers
LLM-ForcedAligner: A Non-Autoregressive and Accurate LLM-Based Forced Aligner for Multilingual and Long-Form Speech
ACL 2026
Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM
ICML 2025
InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training
ACL 2025
A Transcription Prompt-based Efficient Audio Large Language Model for Robust Speech Recognition
INTERSPEECH 2024
A Data-Adaptive RKHS Prior for Bayesian Learning of Kernels in Operators
JMLR 2024
DCCRN-KWS: An Audio Bias Based Model for Noise Robust Small-Footprint Keyword Spotting
INTERSPEECH 2023
Two Stage Contextual Word Filtering for Context Bias in Unified Streaming and Non-streaming Transducer
INTERSPEECH 2023
CaTT-KWS: A Multi-stage Customized Keyword Spotting Framework based on Cascaded Transducer-Transformer
INTERSPEECH 2022
Minimizing Sequential Confusion Error in Speech Command Recognition
INTERSPEECH 2022
Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward
IJCAI 2021
Efficient Conformer with Prob-Sparse Attention Mechanism for End-to-End Speech Recognition
INTERSPEECH 2021
WeNet: Production Oriented Streaming and Non-Streaming End-to-End Speech Recognition Toolkit
INTERSPEECH 2021