Yonghong Yan
36 papers · 2015–2022 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (11) π Renaissance Researcher (5) π Interdisciplinary Bridge π Conference Polyglot (4)
πΊοΈ
Taxonomy Completionist
(11)
π§
Keyword Pioneer
π
Academic Marathon
(7)
π
Conference Loyalist
(31)
π€
Dynamic Duo
(18)
π¬
Deep Specialist
(10)
π§¬
Topic Evolution
π
Conference Pioneer
β‘
Prolific Year
(7)
ποΈ
Keyword Collector
(146)
π
Century Club
(36)
π₯
Unstoppable
(5)
Conferences
INTERSPEECH (31)
SEMEVAL (3)
CONLL (1)
EMNLP (1)
Top co-authors
Keywords
automatic speech recognition
(9)
attention mechanism
(4)
deep neural network
(4)
end-to-end speech recognition
(4)
speech recognition
(4)
bidirectional lstm
(4)
convolutional neural network
(4)
long short-term memory
(3)
acoustic model
(3)
model compression
(3)
speech enhancement
(3)
distant speech recognition
(3)
word error rate
(2)
multi-task learning
(2)
non-negative matrix factorization
(2)
bidirectional long short-term memory
(2)
speech separation
(2)
adversarial learning
(2)
connectionist temporal classification
(2)
speaker diarization
(2)
Papers
Knowledge Distillation For CTC-based Speech Recognition Via Consistent Acoustic Representation Learning
INTERSPEECH 2022
Robust Cough Feature Extraction and Classification Method for COVID-19 Cough Detection Based on Vocalization Characteristics
INTERSPEECH 2022
Improving Streaming End-to-End ASR on Transformer-based Causal Models with Encoder States Revision Strategies
INTERSPEECH 2022
Wav2vec-S: Semi-Supervised Pre-Training for Low-Resource ASR
INTERSPEECH 2022
Improving Recognition of Out-of-vocabulary Words in E2E Code-switching ASR by Fusing Speech Generation Methods
INTERSPEECH 2022
Decoupled Federated Learning for ASR with Non-IID Data
INTERSPEECH 2022
NAS-SCAE: Searching Compact Attention-based Encoders For End-to-end Automatic Speech Recognition
INTERSPEECH 2022
Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization
INTERSPEECH 2022
Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset
INTERSPEECH 2022
Residual Echo and Noise Cancellation with Feature Attention Module and Multi-Domain Loss Function
INTERSPEECH 2021
Incorporating Cross-Speaker Style Transfer for Multi-Language Text-to-Speech
INTERSPEECH 2021
LinearSpeech: Parallel Text-to-Speech with Linear Complexity
INTERSPEECH 2021
Decomposing Complex Questions Makes Multi-Hop QA Easier and More Interpretable
EMNLP 2021
Target Speaker Recovery and Recognition Network with Average x-Vector and Global Training
INTERSPEECH 2019
A New Time-Frequency Attention Mechanism for TDNN and CNN-LSTM-TDNN, with Application to Language Identification
INTERSPEECH 2019
Speaker-Invariant Feature-Mapping for Distant Speech Recognition via Adversarial Teacher-Student Learning
INTERSPEECH 2019
Multi-Accent Adaptation Based on Gate Mechanism
INTERSPEECH 2019
Online Hybrid CTC/Attention Architecture for End-to-End Speech Recognition
INTERSPEECH 2019
Character-Aware Sub-Word Level Language Modeling for Uyghur and Turkish ASR
INTERSPEECH 2019
HCCL at SemEval-2018 Task 8: An End-to-End System for Sequence Labeling from Cybersecurity Reports
SEMEVAL 2018
Cross-Lingual Multi-Task Neural Architecture for Spoken Language Understanding
INTERSPEECH 2018
Multi-talker Speech Separation Based on Permutation Invariant Training and Beamforming
INTERSPEECH 2018
Output-Gate Projected Gated Recurrent Unit for Speech Recognition
INTERSPEECH 2018
Investigation on the Combination of Batch Normalization and Dropout in BLSTM-based Acoustic Modeling for ASR
INTERSPEECH 2018
Deep Convolutional Neural Network with Scalogram for Audio Scene Modeling
INTERSPEECH 2018
Improving Language Modeling with an Adversarial Critic for Automatic Speech Recognition
INTERSPEECH 2018
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition
INTERSPEECH 2017
HCCL at SemEval-2017 Task 2: Combining Multilingual Word Embeddings and Transliteration Model for Semantic Similarity
SEMEVAL 2017
Time Delay Histogram Based Speech Source Separation Using a Planar Array
INTERSPEECH 2017
An Exploration of Dropout with LSTMs
INTERSPEECH 2017
Ideal Ratio Mask Estimation Using Deep Neural Networks for Monaural Speech Segregation in Noisy Reverberant Conditions
INTERSPEECH 2017
Attention-Based LSTM with Multi-Task Learning for Distant Speech Recognition
INTERSPEECH 2017
A DNN-HMM Approach to Non-Negative Matrix Factorization Based Speech Enhancement
INTERSPEECH 2016
Adaptive Group Sparsity for Non-Negative Matrix Factorization with Application to Unsupervised Source Separation
INTERSPEECH 2016
IOA: Improving SVM Based Sentiment Classification Through Post Processing
SEMEVAL 2015
A Shallow Discourse Parsing System Based On Maximum Entropy Model
CONLL 2015