Tan Lee
45 papers · 2016–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+12 more ↓ Show less ↑
π§ Keyword Pioneer π Renaissance Researcher (7) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (26) π Conference Polyglot (2)
π
Academic Marathon
(9)
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Conference Loyalist
(44)
π¬
Deep Specialist
(10)
π
Keyword Champion
(3)
π
Century Club
(45)
π
Conference Pioneer
π
Trend Setter
ποΈ
Keyword Collector
(79)
β‘
Prolific Year
(9)
π₯
Unstoppable
(10)
Conferences
INTERSPEECH (44)
ACL (1)
Top co-authors
Keywords
speaker verification
(8)
deep neural network
(6)
speaker embedding
(5)
automatic speech recognition
(5)
speech sound disorder
(4)
recurrent neural network
(3)
representation learning
(3)
variational autoencoder
(3)
therapist empathy
(3)
child speech
(3)
text-to-speech synthesis
(3)
speech synthesis
(3)
siamese network
(3)
disentangled representation
(2)
convolutional neural network
(2)
feature representation
(2)
multiple instance learning
(2)
domain adaptation
(2)
transfer learning
(2)
model compression
(2)
Papers
PodAgent: A Comprehensive Framework for Podcast Generation
ACL 2025
LUPET: Incorporating Hierarchical Information Path into Multilingual ASR
INTERSPEECH 2024
A Parameter-efficient Language Extension Framework for Multilingual ASR
INTERSPEECH 2024
Learning Representation of Therapist Empathy in Counseling Conversation Using Siamese Hierarchical Attention Network
INTERSPEECH 2024
A Study on Prosodic Entrainment in Relation to Therapist Empathy in Counseling Conversation
INTERSPEECH 2023
CoMFLP: Correlation Measure Based Fast Search on ASR Layer Pruning
INTERSPEECH 2023
Model Compression for DNN-based Speaker Verification Using Weight Quantization
INTERSPEECH 2023
Creating Personalized Synthetic Voices from Post-Glossectomy Speech with Guided Diffusion Models
INTERSPEECH 2023
A Study on Using Duration and Formant Features in Automatic Detection of Speech Sound Disorder in Children
INTERSPEECH 2023
ContextSpeech: Expressive and Efficient Text-to-Speech for Paragraph Reading
INTERSPEECH 2023
EDITnet: A Lightweight Network for Unsupervised Domain Adaptation in Speaker Verification
INTERSPEECH 2022
Transport-Oriented Feature Aggregation for Speaker Embedding Learning
INTERSPEECH 2022
Unifying Cosine and PLDA Back-ends for Speaker Verification
INTERSPEECH 2022
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
INTERSPEECH 2022
Durational Patterning at Discourse Boundaries in Relation to Therapist Empathy in Psychotherapy
INTERSPEECH 2022
Automatic Detection of Speech Sound Disorder in Child Speech Using Posterior-based Speaker Representations
INTERSPEECH 2022
Hierarchical Attention Network for Evaluating Therapist Empathy in Counseling Session
INTERSPEECH 2022
Characterizing Therapist's Speaking Style in Relation to Empathy in Psychotherapy
INTERSPEECH 2022
Environment Aware Text-to-Speech Synthesis
INTERSPEECH 2022
Detection of Consonant Errors in Disordered Speech Based on Consonant-Vowel Segment Embedding
INTERSPEECH 2021
Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement
INTERSPEECH 2021
Pairing Weak with Strong: Twin Models for Defending Against Adversarial Attack on Speaker Verification
INTERSPEECH 2021
Applying the Information Bottleneck Principle to Prosodic Representation Learning
INTERSPEECH 2021
Automatic Detection of Phonological Errors in Child Speech Using Siamese Recurrent Autoencoder
INTERSPEECH 2020
CUCHILD: A Large-Scale Cantonese Corpus of Child Speech for Phonology and Articulation Assessment
INTERSPEECH 2020
Emotion Profile Refinery for Speech Emotion Classification
INTERSPEECH 2020
Text-Independent Speaker Verification with Dual Attention Network
INTERSPEECH 2020
EigenEmo: Spectral Utterance Representation Using Dynamic Mode Decomposition for Speech Emotion Classification
INTERSPEECH 2020
Advancing Multiple Instance Learning with Attention Modeling for Categorical Speech Emotion Recognition
INTERSPEECH 2020
Learning Syllable-Level Discrete Prosodic Representation for Expressive Speech Generation
INTERSPEECH 2020
Child Speech Disorder Detection with Siamese Recurrent Network Using Speech Attribute Features
INTERSPEECH 2019
Automatic Assessment of Language Impairment Based on Raw ASR Output
INTERSPEECH 2019
Deep Learning of Segment-Level Feature Representation with Multiple Instance Learning for Utterance-Level Speech Emotion Recognition
INTERSPEECH 2019
Combining Adversarial Training and Disentangled Speech Representation for Robust Zero-Resource Subword Modeling
INTERSPEECH 2019
Fast DNN Acoustic Model Speaker Adaptation by Learning Hidden Unit Contribution Features
INTERSPEECH 2019
Improving Unsupervised Subword Modeling via Disentangled Speech Representation Learning and Transformation
INTERSPEECH 2019
Automatic Speech Assessment for People with Aphasia Using TDNN-BLSTM with Multi-Task Learning
INTERSPEECH 2018
Exploiting Speaker and Phonetic Diversity of Mismatched Language Resources for Unsupervised Subword Modeling
INTERSPEECH 2018
Improving Cross-Lingual Knowledge Transferability Using Multilingual TDNN-BLSTM with Language-Dependent Pre-Final Layer
INTERSPEECH 2018
Cross-cultural (A)symmetries in Audio-visual Attitude Perception
INTERSPEECH 2018
Acoustic Assessment of Disordered Voice with Continuous Speech Based on Utterance-Level ASR Posterior Features
INTERSPEECH 2017
RNN-LDA Clustering for Feature Based DNN Adaptation
INTERSPEECH 2017
On the Linguistic Relevance of Speech Units Learned by Unsupervised Acoustic Modeling
INTERSPEECH 2017
Hybrid Accelerated Optimization for Speech Recognition
INTERSPEECH 2016
Predicting Severity of Voice Disorder from DNN-HMM Acoustic Posteriors
INTERSPEECH 2016