Yao Qian
21 papers · 2016–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (12) π Conference Polyglot (7)
πΊοΈ
Taxonomy Completionist
(12)
π§
Keyword Pioneer
π
Interdisciplinary Bridge
π§¬
Topic Evolution
π
Keyword Champion
π₯
Mega-Team
(20)
ποΈ
Keyword Collector
(115)
π
Trend Setter
π
Century Club
(21)
π₯
Unstoppable
(10)
π
Conference Pioneer
Conferences
INTERSPEECH (10)
NIPS (3)
ACL (2)
EMNLP (2)
NAACL (2)
AAAI (1)
ICML (1)
Top co-authors
Keywords
speech recognition
(6)
multimodal learning
(4)
automatic speech recognition
(4)
multi-task learning
(4)
non-native speech
(3)
transfer learning
(3)
self-supervised learning
(2)
language proficiency
(2)
language model
(2)
bidirectional lstm
(2)
spoken language
(2)
feature fusion
(2)
speaker recognition
(2)
speech synthesis
(2)
speech-to-speech translation
(2)
spoken language assessment
(2)
text generation
(1)
speech processing
(1)
cross-lingual transfer
(1)
attention mechanism
(1)
Papers
Audio-Aware Large Language Models as Judges for Speaking Styles
EMNLP 2025
i-Code Studio: A Configurable and Composable Framework for Integrative AI
EMNLP 2024
CoVoMix: Advancing Zero-Shot Speech Generation for Human-like Multi-talker Conversations
NIPS 2024
TransVIP: Speech to Speech Translation System with Voice and Isochrony Preservation
NIPS 2024
i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data
NAACL 2024
ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation
NIPS 2023
i-Code: An Integrative and Composable Multimodal Learning Framework
AAAI 2023
Adapting Multi-Lingual ASR Models for Handling Multiple Talkers
INTERSPEECH 2023
Pre-Training Transformer Decoder for End-to-End ASR Model with Unpaired Speech Data
INTERSPEECH 2022
SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language Processing
ACL 2022
UniSpeech: Unified Speech Representation Learning with Labeled and Unlabeled Data
ICML 2021
Discriminative Transfer Learning for Optimizing ASR and Semantic Labeling in Task-Oriented Spoken Dialog
INTERSPEECH 2020
Application of an Automatic Plagiarism Detection System in a Large-scale Assessment of English Speaking Proficiency
ACL 2019
Automated Estimation of Oral Reading Fluency During Summer Camp e-Book Reading with MyTurnToRead
INTERSPEECH 2019
Automatic Detection of Off-Topic Spoken Responses Using Very Deep Convolutional Neural Networks
INTERSPEECH 2019
Improvements to an Automated Content Scoring System for Spoken CALL Responses: the ETS Submission to the Second Spoken CALL Shared Task
INTERSPEECH 2018
Bidirectional LSTM-RNN for Improving Automated Assessment of Non-Native Childrenβs Speech
INTERSPEECH 2017
Improving Sub-Phone Modeling for Better Native Language Identification with Non-Native English Speech
INTERSPEECH 2017
Noise and Metadata Sensitive Bottleneck Features for Improving Speaker Recognition with Non-Native Speech Input
INTERSPEECH 2016
Learning Distributed Word Representations For Bidirectional LSTM Recurrent Neural Network
NAACL 2016
Self-Adaptive DNN for Improving Spoken Language Proficiency Assessment
INTERSPEECH 2016