Chin-Hui Lee

38 papers · 2015–2024 · 4 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (16) 🏃 Academic Marathon (9)

🐝 Cross-Pollinator (9) 🧭 Keyword Pioneer 🏃 Academic Marathon (9) 🏠 Conference Loyalist (35) 🔬 Deep Specialist (10) 🏆 Keyword Champion (3) 🤝 Dynamic Duo (27) 🚀 Conference Pioneer 🗃️ Keyword Collector (151) ⚡ Prolific Year (5) 💎 Century Club (38) 🔥 Unstoppable (10)

Conferences

INTERSPEECH (35) ACL (1) CVPR (1) IJCNLP (1)

Top co-authors

Jun Du (27) Sabato Marco Siniscalchi (10) Hang Chen (6) Li Chai (6) Lei Sun (4) Yannan Wang (4) Hengshun Zhou (4) Yan-Hui Tu (3) Hu Hu (3) Kehuang Li (3)

Research topics

Differential Privacy (1)

Keywords

speech enhancement (14) deep neural network (7) long short-term memory (6) speaker diarization (4) acoustic scene classification (4) transfer learning (3) knowledge distillation (3) multimodal learning (3) maximum likelihood (3) acoustic model (3) speech attribute (3) mispronunciation detection (2) audio-visual speech recognition (2) voice activity detection (2) distant speech recognition (2) convolutional neural network (2) acoustic modeling (2) progressive learning (2) domain adaptation (2) data augmentation (2)

Papers

Enhancing Voice Wake-Up for Dysarthria: Mandarin Dysarthria Speech Corpus Release and Customized System Design INTERSPEECH 2024 Language-Universal Speech Attributes Modeling for Zero-Shot Multilingual Spoken Keyword Recognition INTERSPEECH 2024 A Study of Dropout-Induced Modality Bias on Robustness to Missing Video Frames for Audio-Visual Speech Recognition CVPR 2024 AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in the SUPERB Benchmark INTERSPEECH 2023 A Multiple-Teacher Pruning Based Self-Distillation (MT-PSD) Approach to Model Compression for Audio-Visual Wake Word Spotting INTERSPEECH 2023 Unsupervised Adaptation with Quality-Aware Masking to Improve Target-Speaker Voice Activity Detection for Speaker Diarization INTERSPEECH 2023 A Multi-dimensional Deep Structured State Space Approach to Speech Enhancement Using Small-footprint Models INTERSPEECH 2023 Variance-Preserving-Based Interpolation Diffusion Models for Speech Enhancement INTERSPEECH 2023 Audio-Visual Speech Recognition in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 Deep Segment Model for Acoustic Scene Classification INTERSPEECH 2022 End-to-End Audio-Visual Neural Speaker Diarization INTERSPEECH 2022 Audio-Visual Wake Word Spotting in MISP2021 Challenge: Dataset Release and Deep Analysis INTERSPEECH 2022 PATE-AAE: Incorporating Adversarial Autoencoder into Private Aggregation of Teacher Ensembles for Spoken Command Classification INTERSPEECH 2021 Scenario-Dependent Speaker Diarization for DIHARD-III Challenge INTERSPEECH 2021 Automatic Lip-Reading with Hierarchical Pyramidal Convolution and Self-Attention for Image Sequences with No Word Boundaries INTERSPEECH 2021 Audio-Visual Information Fusion Using Cross-Modal Teacher-Student Learning for Voice Activity Detection in Realistic Environments INTERSPEECH 2021 A Maximum Likelihood Approach to SNR-Progressive Learning Using Generalized Gaussian Distribution for LSTM-Based Speech Enhancement INTERSPEECH 2021 An Acoustic Segment Model Based Segment Unit Selection Approach to Acoustic Scene Classification with Partial Utterances INTERSPEECH 2020 Exploring Deep Hybrid Tensor-to-Vector Network Architectures for Regression Based Speech Enhancement INTERSPEECH 2020 A Space-and-Speaker-Aware Iterative Mask Estimation Approach to Multi-Channel Speech Recognition in the CHiME-6 Challenge INTERSPEECH 2020 Relational Teacher Student Learning with Neural Label Embedding for Device Adaptation in Acoustic Scene Classification INTERSPEECH 2020 Using Speech Enhancement Preprocessing for Speech Emotion Recognition in Realistic Noisy Conditions INTERSPEECH 2020 A Noise-Aware Memory-Attention Network Architecture for Regression-Based Speech Enhancement INTERSPEECH 2020 A Hybrid Approach to Acoustic Scene Classification Based on Universal Acoustic Models INTERSPEECH 2019 A Cross-Entropy-Guided (CEG) Measure for Speech Enhancement Front-End Assessing Performances of Back-End Automatic Speech Recognition INTERSPEECH 2019 KL-Divergence Regularized Deep Neural Network Adaptation for Low-Resource Speaker-Dependent Speech Enhancement INTERSPEECH 2019 Acoustic Model Ensembling Using Effective Data Augmentation for CHiME-5 Challenge INTERSPEECH 2019 Speaker Diarization with Enhancing Speech for the First DIHARD Challenge INTERSPEECH 2018 Error Modeling via Asymmetric Laplace Distribution for Deep Neural Network Based Single-Channel Speech Enhancement INTERSPEECH 2018 A Maximum Likelihood Approach to Deep Neural Network Based Nonlinear Spectral Mapping for Single-Channel Speech Separation INTERSPEECH 2017 Joint Training of Multi-Channel-Condition Dereverberation and Acoustic Modeling of Microphone Array Speech for Robust Distant Speech Recognition INTERSPEECH 2017 Improving Mispronunciation Detection for Non-Native Learners with Multisource Information and LSTM-Based Deep Models INTERSPEECH 2017 On Design of Robust Deep Models for CHiME-4 Multi-Channel Speech Recognition with Multiple Configurations of Array Microphones INTERSPEECH 2017 SNR-Based Progressive Learning of Deep Neural Network for Speech Enhancement INTERSPEECH 2016 An Iterative Phase Recovery Framework with Phase Mask for Spectral Mapping with an Application to Speech Enhancement INTERSPEECH 2016 Detecting Mispronunciations of L2 Learners and Providing Corrective Feedback Using Knowledge-Guided and Data-Driven Decision Trees INTERSPEECH 2016 Tweet Normalization with Syllables IJCNLP 2015 Tweet Normalization with Syllables ACL 2015