Yonghong Yan

36 papers · 2015–2022 · 4 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (11) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (4)

🗺️ Taxonomy Completionist (11) 🧭 Keyword Pioneer 🏃 Academic Marathon (7) 🏠 Conference Loyalist (31) 🤝 Dynamic Duo (18) 🔬 Deep Specialist (10) 🧬 Topic Evolution 🚀 Conference Pioneer ⚡ Prolific Year (7) 🗃️ Keyword Collector (146) 💎 Century Club (36) 🔥 Unstoppable (5)

Conferences

INTERSPEECH (31) SEMEVAL (3) CONLL (1) EMNLP (1)

Top co-authors

Pengyuan Zhang (18) Gaofeng Cheng (11) Ta Li (4) Ziteng Wang (3) Li Wang (3) Lingxuan Ye (3) Han Zhu (3) Junfeng Li (3) Zhihua Huang (3) Xuemin Zhao (3)

Keywords

automatic speech recognition (9) attention mechanism (4) deep neural network (4) end-to-end speech recognition (4) speech recognition (4) bidirectional lstm (4) convolutional neural network (4) long short-term memory (3) acoustic model (3) model compression (3) speech enhancement (3) distant speech recognition (3) word error rate (2) multi-task learning (2) non-negative matrix factorization (2) bidirectional long short-term memory (2) speech separation (2) adversarial learning (2) connectionist temporal classification (2) speaker diarization (2)

Papers

Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning INTERSPEECH 2022 Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics INTERSPEECH 2022 Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies INTERSPEECH 2022 Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR INTERSPEECH 2022 Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods INTERSPEECH 2022 Decoupled Federated Learning for ASR with Non-IID Data INTERSPEECH 2022 NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition INTERSPEECH 2022 Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization INTERSPEECH 2022 Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset INTERSPEECH 2022 Residual Echo and Noise Cancellation with Feature Attention Module and Multi-Domain Loss Function INTERSPEECH 2021 Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech INTERSPEECH 2021 LinearSpeech: Parallel Text-to-Speech with Linear Complexity INTERSPEECH 2021 Decomposing Complex Questions Makes Multi-Hop QA Easier and More Interpretable EMNLP 2021 Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training INTERSPEECH 2019 A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification INTERSPEECH 2019 Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning INTERSPEECH 2019 Multi-Accent Adaptation Based on Gate Mechanism INTERSPEECH 2019 Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition INTERSPEECH 2019 Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR INTERSPEECH 2019 HCCL at SemEval-2018 Task 8: An End-to-End System for Sequence Labeling from Cybersecurity Reports SEMEVAL 2018 Cross-Lingual Multi-Task Neural Architecture for Spoken Language Understanding INTERSPEECH 2018 Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming INTERSPEECH 2018 Output-Gate Projected Gated Recurrent Unit for Speech Recognition INTERSPEECH 2018 Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR INTERSPEECH 2018 Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling INTERSPEECH 2018 Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition INTERSPEECH 2018 Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition INTERSPEECH 2017 HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity SEMEVAL 2017 Time Delay Histogram Based Speech Source Separation Using a Planar Array INTERSPEECH 2017 An Exploration of Dropout with LSTMs INTERSPEECH 2017 Ideal Ratio Mask Estimation Using Deep Neural Networks for Monaural Speech Segregation in Noisy Reverberant Conditions INTERSPEECH 2017 Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition INTERSPEECH 2017 A DNN-HMM Approach to Non-Negative Matrix Factorization Based Speech Enhancement INTERSPEECH 2016 Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation INTERSPEECH 2016 IOA: Improving SVM Based Sentiment Classification Through Post Processing SEMEVAL 2015 A Shallow Discourse Parsing System Based On Maximum Entropy Model CONLL 2015