Tan Lee

45 papers · 2016–2025 · 2 conferences · across top CS/AI conferences

Achievements

+12 more ↓

🧭 Keyword Pioneer 🌈 Renaissance Researcher (7) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (26) 🌍 Conference Polyglot (2)

🏃 Academic Marathon (9) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🏠 Conference Loyalist (44) 🔬 Deep Specialist (10) 🏆 Keyword Champion (3) 💎 Century Club (45) 🚀 Conference Pioneer 📈 Trend Setter 🗃️ Keyword Collector (79) ⚡ Prolific Year (9) 🔥 Unstoppable (10)

Conferences

INTERSPEECH (44) ACL (1)

Top co-authors

Siyuan Feng (6) Jingyu Li (5) Guangyan Zhang (5) Wei Liu (5) P.C. Ching (5) Dehua Tao (5) Zhiyuan Peng (5) Si-Ioi Ng (5) Ying Qin (5) Harold Chui (5)

Keywords

speaker verification (8) deep neural network (6) speaker embedding (5) automatic speech recognition (5) speech sound disorder (4) recurrent neural network (3) representation learning (3) variational autoencoder (3) therapist empathy (3) child speech (3) text-to-speech synthesis (3) speech synthesis (3) siamese network (3) disentangled representation (2) convolutional neural network (2) feature representation (2) multiple instance learning (2) domain adaptation (2) transfer learning (2) model compression (2)

Papers

PodAgent: A Comprehensive Framework for Podcast Generation ACL 2025 LUPET: Incorporating Hierarchical Information Path into Multilingual ASR INTERSPEECH 2024 A Parameter-efficient Language Extension Framework for Multilingual ASR INTERSPEECH 2024 Learning Representation of Therapist Empathy in Counseling Conversation Using Siamese Hierarchical Attention Network INTERSPEECH 2024 A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation INTERSPEECH 2023 CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning INTERSPEECH 2023 Model Compression for DNN-based Speaker Verification Using Weight Quantization INTERSPEECH 2023 Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models INTERSPEECH 2023 A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children INTERSPEECH 2023 ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading INTERSPEECH 2023 EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification INTERSPEECH 2022 Transport-Oriented Feature Aggregation for Speaker Embedding Learning INTERSPEECH 2022 Unifying Cosine and PLDA Back-ends for Speaker Verification INTERSPEECH 2022 Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech INTERSPEECH 2022 Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy INTERSPEECH 2022 Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations INTERSPEECH 2022 Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session INTERSPEECH 2022 Characterizing Therapist's Speaking Style in Relation to Empathy in Psychotherapy INTERSPEECH 2022 Environment Aware Text-to-Speech Synthesis INTERSPEECH 2022 Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding INTERSPEECH 2021 Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement INTERSPEECH 2021 Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification INTERSPEECH 2021 Applying the Information Bottleneck Principle to Prosodic Representation Learning INTERSPEECH 2021 Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder INTERSPEECH 2020 CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment INTERSPEECH 2020 Emotion Profile Refinery for Speech Emotion Classification INTERSPEECH 2020 Text-Independent Speaker Verification with Dual Attention Network INTERSPEECH 2020 EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification INTERSPEECH 2020 Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition INTERSPEECH 2020 Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation INTERSPEECH 2020 Child Speech Disorder Detection with Siamese Recurrent Network Using Speech Attribute Features INTERSPEECH 2019 Automatic Assessment of Language Impairment Based on Raw ASR Output INTERSPEECH 2019 Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition INTERSPEECH 2019 Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling INTERSPEECH 2019 Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features INTERSPEECH 2019 Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation INTERSPEECH 2019 Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning INTERSPEECH 2018 Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling INTERSPEECH 2018 Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer INTERSPEECH 2018 Cross-cultural (A)symmetries in Audio-visual Attitude Perception INTERSPEECH 2018 Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features INTERSPEECH 2017 RNN-LDA Clustering for Feature Based DNN Adaptation INTERSPEECH 2017 On the Linguistic Relevance of Speech Units Learned by Unsupervised Acoustic Modeling INTERSPEECH 2017 Hybrid Accelerated Optimization for Speech Recognition INTERSPEECH 2016 Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors INTERSPEECH 2016