Xunying Liu
55 papers · 2016–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (22) π§ Keyword Pioneer π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π
Conference Polyglot
(5)
π
Cross-Pollinator
(10)
πΊοΈ
Taxonomy Completionist
(22)
π
Conference Loyalist
(51)
π€
Dynamic Duo
(40)
π§¬
Topic Evolution
π
Keyword Champion
(3)
π₯
Mega-Team
(20)
π¬
Deep Specialist
(19)
π
Trend Setter
π
Conference Pioneer
π₯
Unstoppable
(10)
β‘
Prolific Year
(9)
π
Century Club
(55)
ποΈ
Keyword Collector
(71)
Conferences
INTERSPEECH (51)
ACL (1)
AISTATS (1)
CVPR (1)
EMNLP (1)
Top co-authors
Keywords
automatic speech recognition
(14)
speaker adaptation
(13)
speech recognition
(13)
deep neural network
(5)
voice conversion
(5)
domain adaptation
(5)
word error rate
(5)
neural architecture search
(4)
disordered speech recognition
(4)
disordered speech
(4)
speech separation
(4)
dysarthric speech
(4)
dysarthric speech recognition
(4)
bayesian inference
(4)
bayesian learning
(3)
cross-domain adaptation
(3)
speaker identity
(3)
acoustic model
(3)
data augmentation
(3)
recurrent neural network
(3)
Papers
GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement
ACL 2025
Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System
INTERSPEECH 2024
WavLLM: Towards Robust and Adaptive Speech Large Language Model
EMNLP 2024
One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model
INTERSPEECH 2024
Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition
INTERSPEECH 2024
Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition
INTERSPEECH 2024
Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask
INTERSPEECH 2024
Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders
INTERSPEECH 2023
Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition
INTERSPEECH 2023
Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems
INTERSPEECH 2023
Lossless 4-bit Quantization of Architecture Compressed Conformer ASR Systems on the 300-hr Switchboard Corpus
INTERSPEECH 2023
Use of Speech Impairment Severity for Dysarthric Speech Recognition
INTERSPEECH 2023
Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition
INTERSPEECH 2023
Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems
INTERSPEECH 2023
On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition
INTERSPEECH 2023
Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Swithboard Corpus
INTERSPEECH 2022
Conformer Based Elderly Speech Recognition System for Alzheimerβs Disease Detection
INTERSPEECH 2022
Exploring linguistic feature and model combination for speech recognition based automatic AD detection
INTERSPEECH 2022
Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems
INTERSPEECH 2022
Confidence Score Based Conformer Speaker Adaptation for Speech Recognition
INTERSPEECH 2022
Context-aware Multimodal Fusion for Emotion Recognition
INTERSPEECH 2022
Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition
INTERSPEECH 2021
Understanding the wiring evolution in differentiable neural architecture search
AISTATS 2021
VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion
INTERSPEECH 2021
Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization
INTERSPEECH 2021
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis
INTERSPEECH 2021
Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks
INTERSPEECH 2021
Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition
INTERSPEECH 2021
Adversarial Data Augmentation for Disordered Speech Recognition
INTERSPEECH 2021
Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion
INTERSPEECH 2021
Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition
INTERSPEECH 2021
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
INTERSPEECH 2020
Speaker-Aware Linear Discriminant Analysis in Speaker Verification
INTERSPEECH 2020
Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition
INTERSPEECH 2020
Audio-Visual Multi-Channel Recognition of Overlapped Speech
INTERSPEECH 2020
DSNAS: Direct Neural Architecture Search Without Parameter Retraining
CVPR 2020
Investigation of Data Augmentation Techniques for Disordered Speech Recognition
INTERSPEECH 2020
Transferring Source Style in Non-Parallel Voice Conversion
INTERSPEECH 2020
Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition
INTERSPEECH 2019
The CUHK Dysarthric Speech Recognition Systems for English and Cantonese
INTERSPEECH 2019
Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models
INTERSPEECH 2019
Unsupervised Methods for Audio Classification from Lecture Discussion Recordings
INTERSPEECH 2019
LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition
INTERSPEECH 2019
Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition
INTERSPEECH 2019
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features
INTERSPEECH 2019
Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams
INTERSPEECH 2019
On the Use of Pitch Features for Disordered Speech Recognition
INTERSPEECH 2019
Gaussian Process Neural Networks for Speech Recognition
INTERSPEECH 2018
Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription
INTERSPEECH 2018
Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis
INTERSPEECH 2018
Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance
INTERSPEECH 2018
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis
INTERSPEECH 2018
Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus
INTERSPEECH 2018
RNN-LDA Clustering for Feature Based DNN Adaptation
INTERSPEECH 2017
Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information
INTERSPEECH 2016