conftrace_

Xunying Liu

55 papers · 2016–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+15 more ↓

🗺️ Taxonomy Completionist (22) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🌍 Conference Polyglot (5) 🐝 Cross-Pollinator (10) 🗺️ Taxonomy Completionist (22) 🏠 Conference Loyalist (51) 🤝 Dynamic Duo (40) 🧬 Topic Evolution 🏆 Keyword Champion (3) 👥 Mega-Team (20) 🔬 Deep Specialist (19) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (10) ⚡ Prolific Year (9) 💎 Century Club (55) 🗃️ Keyword Collector (71)

Conferences

INTERSPEECH (51) ACL (1) AISTATS (1) CVPR (1) EMNLP (1)

Top co-authors

Helen Meng (40) Xurong Xie (21) Shoukang Hu (21) Mengzhe Geng (17) Xixin Wu (16) Jianwei Yu (14) Tianzi Wang (14) Jiajun Deng (13) Zengrui Jin (12) Shansong Liu (11)

Keywords

automatic speech recognition (14) speaker adaptation (13) speech recognition (13) deep neural network (5) voice conversion (5) domain adaptation (5) word error rate (5) neural architecture search (4) disordered speech recognition (4) disordered speech (4) speech separation (4) dysarthric speech (4) dysarthric speech recognition (4) bayesian inference (4) bayesian learning (3) cross-domain adaptation (3) speaker identity (3) acoustic model (3) data augmentation (3) recurrent neural network (3)

Papers

GigaSpeech 2: An Evolving, Large-Scale and Multi-domain ASR Corpus for Low-Resource Languages with Automated Crawling, Transcription and Refinement ACL 2025 Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System INTERSPEECH 2024 WavLLM: Towards Robust and Adaptive Speech Large Language Model EMNLP 2024 One-pass Multiple Conformer and Foundation Speech Systems Compression and Quantization Using An All-in-one Neural Model INTERSPEECH 2024 Perceiver-Prompt: Flexible Speaker Adaptation in Whisper for Chinese Disordered Speech Recognition INTERSPEECH 2024 Joint Speaker Features Learning for Audio-visual Multichannel Speech Separation and Recognition INTERSPEECH 2024 Towards Effective and Efficient Non-autoregressive Decoding Using Block-based Attention Mask INTERSPEECH 2024 Integrated and Enhanced Pipeline System to Support Spoken Language Analytics for Screening Neurocognitive Disorders INTERSPEECH 2023 Hyper-parameter Adaptation of Conformer ASR Systems for Elderly and Dysarthric Speech Recognition INTERSPEECH 2023 Factorised Speaker-environment Adaptive Training of Conformer Speech Recognition Systems INTERSPEECH 2023 Lossless 4-bit Quantization of Architecture Compressed Conformer ASR Systems on the 300-hr Switchboard Corpus INTERSPEECH 2023 Use of Speech Impairment Severity for Dysarthric Speech Recognition INTERSPEECH 2023 Exploiting Cross-Domain And Cross-Lingual Ultrasound Tongue Imaging Features For Elderly And Dysarthric Speech Recognition INTERSPEECH 2023 Towards Effective and Compact Contextual Representation for Conformer Transducer Speech Recognition Systems INTERSPEECH 2023 On-the-Fly Feature Based Rapid Speaker Adaptation for Dysarthric and Elderly Speech Recognition INTERSPEECH 2023 Towards Green ASR: Lossless 4-bit Quantization of a Hybrid TDNN System on the 300-hr Swithboard Corpus INTERSPEECH 2022 Conformer Based Elderly Speech Recognition System for Alzheimer’s Disease Detection INTERSPEECH 2022 Exploring linguistic feature and model combination for speech recognition based automatic AD detection INTERSPEECH 2022 Two-pass Decoding and Cross-adaptation Based System Combination of End-to-end Conformer and Hybrid TDNN ASR Systems INTERSPEECH 2022 Confidence Score Based Conformer Speaker Adaptation for Speech Recognition INTERSPEECH 2022 Context-aware Multimodal Fusion for Emotion Recognition INTERSPEECH 2022 Variational Auto-Encoder Based Variability Encoding for Dysarthric Speech Recognition INTERSPEECH 2021 Understanding the wiring evolution in differentiable neural architecture search AISTATS 2021 VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion INTERSPEECH 2021 Unsupervised Domain Adaptation for Dysarthric Speech Detection via Domain Adversarial Training and Mutual Information Minimization INTERSPEECH 2021 VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis INTERSPEECH 2021 Channel-Wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks INTERSPEECH 2021 Spectro-Temporal Deep Features for Disordered Speech Assessment and Recognition INTERSPEECH 2021 Adversarial Data Augmentation for Disordered Speech Recognition INTERSPEECH 2021 Learning Explicit Prosody Models and Deep Speaker Embeddings for Atypical Voice Conversion INTERSPEECH 2021 Bayesian Parametric and Architectural Domain Adaptation of LF-MMI Trained TDNNs for Elderly and Dysarthric Speech Recognition INTERSPEECH 2021 Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification INTERSPEECH 2020 Speaker-Aware Linear Discriminant Analysis in Speaker Verification INTERSPEECH 2020 Exploiting Cross-Domain Visual Feature Generation for Disordered Speech Recognition INTERSPEECH 2020 Audio-Visual Multi-Channel Recognition of Overlapped Speech INTERSPEECH 2020 DSNAS: Direct Neural Architecture Search Without Parameter Retraining CVPR 2020 Investigation of Data Augmentation Techniques for Disordered Speech Recognition INTERSPEECH 2020 Transferring Source Style in Non-Parallel Voice Conversion INTERSPEECH 2020 Exploiting Visual Features Using Bayesian Gated Neural Networks for Disordered Speech Recognition INTERSPEECH 2019 The CUHK Dysarthric Speech Recognition Systems for English and Cantonese INTERSPEECH 2019 Comparative Study of Parametric and Representation Uncertainty Modeling for Recurrent Neural Network Language Models INTERSPEECH 2019 Unsupervised Methods for Audio Classification from Lecture Discussion Recordings INTERSPEECH 2019 LF-MMI Training of Bayesian and Gaussian Process Time Delay Neural Networks for Speech Recognition INTERSPEECH 2019 Extract, Adapt and Recognize: An End-to-End Neural Network for Corrupted Monaural Speech Recognition INTERSPEECH 2019 Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features INTERSPEECH 2019 Jointly Trained Conversion Model and WaveNet Vocoder for Non-Parallel Voice Conversion Using Mel-Spectrograms and Phonetic Posteriorgrams INTERSPEECH 2019 On the Use of Pitch Features for Disordered Speech Recognition INTERSPEECH 2019 Gaussian Process Neural Networks for Speech Recognition INTERSPEECH 2018 Semi-supervised Cross-domain Visual Feature Learning for Audio-Visual Broadcast Speech Transcription INTERSPEECH 2018 Unsupervised Discovery of Non-native Phonetic Patterns in L2 English Speech for Mispronunciation Detection and Diagnosis INTERSPEECH 2018 Voice Conversion Across Arbitrary Speakers Based on a Single Target-Speaker Utterance INTERSPEECH 2018 Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis INTERSPEECH 2018 Development of the CUHK Dysarthric Speech Recognition System for the UA Speech Corpus INTERSPEECH 2018 RNN-LDA Clustering for Feature Based DNN Adaptation INTERSPEECH 2017 Deep Neural Network Based Acoustic-to-Articulatory Inversion Using Phone Sequence Information INTERSPEECH 2016