Yao Qian

21 papers · 2016–2025 · 7 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🗺️ Taxonomy Completionist (12) 🌍 Conference Polyglot (7)

🗺️ Taxonomy Completionist (12) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🧬 Topic Evolution 🏆 Keyword Champion 👥 Mega-Team (20) 🗃️ Keyword Collector (115) 📈 Trend Setter 💎 Century Club (21) 🔥 Unstoppable (10) 🚀 Conference Pioneer

Conferences

INTERSPEECH (10) NIPS (3) ACL (2) EMNLP (2) NAACL (2) AAAI (1) ICML (1)

Top co-authors

Michael Zeng (9) Keelan Evanini (7) Shujie LIU (6) Xuedong Huang (5) Long Zhou (5) Xinhao Wang (5) Yanmin Qian (4) Jinyu Li (4) Takuya Yoshioka (4) Yichong Xu (3)

Keywords

speech recognition (6) multimodal learning (4) automatic speech recognition (4) multi-task learning (4) non-native speech (3) transfer learning (3) self-supervised learning (2) language proficiency (2) language model (2) bidirectional lstm (2) spoken language (2) feature fusion (2) speaker recognition (2) speech synthesis (2) speech-to-speech translation (2) spoken language assessment (2) text generation (1) speech processing (1) cross-lingual transfer (1) attention mechanism (1)

Papers

Audio-Aware Large Language Models as Judges for Speaking Styles EMNLP 2025 i-Code Studio: A Configurable and Composable Framework for Integrative AI EMNLP 2024 CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations NIPS 2024 TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation NIPS 2024 i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data NAACL 2024 ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation NIPS 2023 i-Code: An Integrative and Composable Multimodal Learning Framework AAAI 2023 Adapting Multi-Lingual ASR Models for Handling Multiple Talkers INTERSPEECH 2023 Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data INTERSPEECH 2022 SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing ACL 2022 UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data ICML 2021 Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-Oriented Spoken Dialog INTERSPEECH 2020 Application of an Automatic Plagiarism Detection System in a Large-scale Assessment of English Speaking Proficiency ACL 2019 Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead INTERSPEECH 2019 Automatic Detection of Off-Topic Spoken Responses Using Very Deep Convolutional Neural Networks INTERSPEECH 2019 Improvements to an Automated Content Scoring System for Spoken CALL Responses: the ETS Submission to the Second Spoken CALL Shared Task INTERSPEECH 2018 Bidirectional LSTM-RNN for Improving Automated Assessment of Non-Native Children’s Speech INTERSPEECH 2017 Improving Sub-Phone Modeling for Better Native Language Identification with Non-Native English Speech INTERSPEECH 2017 Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-Native Speech Input INTERSPEECH 2016 Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network NAACL 2016 Self-Adaptive DNN for Improving Spoken Language Proficiency Assessment INTERSPEECH 2016