Daxin Tan
5 papers · 2021–2025 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (14) π§ Keyword Pioneer π Conference Polyglot (2) π Interdisciplinary Bridge π Cross-Pollinator (14)
π
Renaissance Researcher
(5)
π₯
Mega-Team
(30)
Conferences
INTERSPEECH (4)
CVPR (1)
Top co-authors
Keywords
text-to-speech synthesis
(3)
representation learning
(2)
transfer learning
(1)
information bottleneck
(1)
style transfer
(1)
speech synthesis
(1)
speech processing
(1)
multimodal learning
(1)
emotion recognition
(1)
contextual representation
(1)
speaker embedding
(1)
variational autoencoder
(1)
foundation model
(1)
vision-language model
(1)
factor disentanglement
(1)
spoken dialogue
(1)
speech generation
(1)
acoustic environment
(1)
phoneme representation
(1)
bert pre-training
(1)
Papers
EMOVA: Empowering Language Models to See, Hear and Speak with Vivid Emotions
CVPR 2025
Mixed-Phoneme BERT: Improving BERT with Mixed Phoneme and Sup-Phoneme Representations for Text to Speech
INTERSPEECH 2022
Environment Aware Text-to-Speech Synthesis
INTERSPEECH 2022
Applying the Information Bottleneck Principle to Prosodic Representation Learning
INTERSPEECH 2021
Fine-Grained Style Modeling, Transfer and Prediction in Text-to-Speech Synthesis via Phone-Level Content-Style Disentanglement
INTERSPEECH 2021