Shi-Xiong Zhang
21 papers · 2019–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (4) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer π Academic Marathon (6)
π
Cross-Pollinator
(9)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π
Keyword Champion
(3)
π₯
Mega-Team
(51)
π€
Dynamic Duo
(14)
β‘
Prolific Year
(5)
ποΈ
Keyword Collector
(82)
π
Century Club
(19)
Conferences
INTERSPEECH (16)
AAAI (1)
ACL (1)
EACL (1)
ICLR (1)
NAACL (1)
Top co-authors
Keywords
speech separation
(8)
spatial feature
(3)
automatic speech recognition
(3)
large language model
(3)
speech recognition
(3)
recurrent neural network
(2)
contrastive learning
(2)
speaker diarization
(2)
minimum variance distortionless response
(2)
multi-channel speech
(2)
multi-channel processing
(2)
directional feature
(2)
multi-channel audio
(2)
speech enhancement
(2)
evaluation methodology
(1)
visual question answering
(1)
multimodal learning
(1)
source separation
(1)
multi-task learning
(1)
cross-domain learning
(1)
Papers
Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection
ACL 2026
Lessons from the Field: An Adaptable Lifecycle Approach to Applied Dialogue Summarization
EACL 2026
WorldCuisines: A Massive-Scale Benchmark for Multilingual and Multicultural Visual Question Answering on Global Cuisines
NAACL 2025
RainbowPO: A Unified Framework for Combining Improvements in Preference Optimization
ICLR 2025
SECap: Speech Emotion Captioning with Large Language Model
AAAI 2024
LibriheavyMix: A 20,000-Hour Dataset for Single-Channel Reverberant Multi-Talker Speech Separation, ASR and Speaker Diarization
INTERSPEECH 2024
Comparing Discrete and Continuous Space LLMs for Speech Recognition
INTERSPEECH 2024
RIR-SF: Room Impulse Response Based Spatial Feature for Target Speech Recognition in Multi-Channel Multi-Speaker Scenarios
INTERSPEECH 2024
Multi-Channel Multi-Speaker ASR Using Target Speakerβs Solo Segment
INTERSPEECH 2024
Joint Neural AEC and Beamforming with Double-Talk Detection
INTERSPEECH 2022
MetricNet: Towards Improved Modeling For Non-Intrusive Speech Quality Assessment
INTERSPEECH 2021
Generalized Spatio-Temporal RNN Beamformer for Target Speech Separation
INTERSPEECH 2021
Multi-Channel Speaker Verification for Single and Multi-Talker Speech
INTERSPEECH 2021
MIMO Self-Attentive RNN Beamformer for Multi-Speaker Speech Separation
INTERSPEECH 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
INTERSPEECH 2021
Neural Spatio-Temporal Beamformer for Target Speech Separation
INTERSPEECH 2020
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition
INTERSPEECH 2020
Audio-Visual Multi-Channel Recognition of Overlapped Speech
INTERSPEECH 2020
A Comprehensive Study of Speech Separation: Spectrogram vs Waveform Separation
INTERSPEECH 2019
Neural Spatial Filter: Target Speaker Speech Separation Assisted with Directional Information
INTERSPEECH 2019
Improved Speaker-Dependent Separation for CHiME-5 Challenge
INTERSPEECH 2019