Chin-Hui Lee
38 papers · 2015–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (4) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (16) π Academic Marathon (9)
π
Cross-Pollinator
(9)
π§
Keyword Pioneer
π
Academic Marathon
(9)
π
Conference Loyalist
(35)
π¬
Deep Specialist
(10)
π
Keyword Champion
(3)
π€
Dynamic Duo
(27)
π
Conference Pioneer
ποΈ
Keyword Collector
(151)
β‘
Prolific Year
(5)
π
Century Club
(38)
π₯
Unstoppable
(10)
Conferences
INTERSPEECH (35)
ACL (1)
CVPR (1)
IJCNLP (1)
Top co-authors
Research topics
Keywords
speech enhancement
(14)
deep neural network
(7)
long short-term memory
(6)
speaker diarization
(4)
acoustic scene classification
(4)
transfer learning
(3)
knowledge distillation
(3)
multimodal learning
(3)
maximum likelihood
(3)
acoustic model
(3)
speech attribute
(3)
mispronunciation detection
(2)
audio-visual speech recognition
(2)
voice activity detection
(2)
distant speech recognition
(2)
convolutional neural network
(2)
acoustic modeling
(2)
progressive learning
(2)
domain adaptation
(2)
data augmentation
(2)
Papers
Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design
INTERSPEECH 2024
Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition
INTERSPEECH 2024
A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition
CVPR 2024
AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark
INTERSPEECH 2023
A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting
INTERSPEECH 2023
Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization
INTERSPEECH 2023
A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models
INTERSPEECH 2023
Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement
INTERSPEECH 2023
Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis
INTERSPEECH 2022
Deep Segment Model for Acoustic Scene Classification
INTERSPEECH 2022
End-to-End Audio-Visual Neural Speaker Diarization
INTERSPEECH 2022
Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis
INTERSPEECH 2022
PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification
INTERSPEECH 2021
Scenario-Dependent Speaker Diarization for DIHARD-III Challenge
INTERSPEECH 2021
Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries
INTERSPEECH 2021
Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments
INTERSPEECH 2021
A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement
INTERSPEECH 2021
An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances
INTERSPEECH 2020
Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement
INTERSPEECH 2020
A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge
INTERSPEECH 2020
Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification
INTERSPEECH 2020
Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions
INTERSPEECH 2020
A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement
INTERSPEECH 2020
A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models
INTERSPEECH 2019
A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition
INTERSPEECH 2019
KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement
INTERSPEECH 2019
Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge
INTERSPEECH 2019
Speaker Diarization with Enhancing Speech for the First DIHARD Challenge
INTERSPEECH 2018
Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement
INTERSPEECH 2018
A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation
INTERSPEECH 2017
Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition
INTERSPEECH 2017
Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models
INTERSPEECH 2017
On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones
INTERSPEECH 2017
SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement
INTERSPEECH 2016
An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement
INTERSPEECH 2016
Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees
INTERSPEECH 2016
Tweet Normalization with Syllables
IJCNLP 2015
Tweet Normalization with Syllables
ACL 2015