Shiyin Kang
16 papers · 2016–2024 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (8)
🐝
Cross-Pollinator
(8)
🌉
Interdisciplinary Bridge
🏆
Keyword Champion
(2)
🤝
Dynamic Duo
(12)
👥
Mega-Team
(32)
🧬
Topic Evolution
🗃️
Keyword Collector
(88)
🚀
Conference Pioneer
💎
Century Club
(16)
🔥
Unstoppable
(7)
Conferences
INTERSPEECH (14)
ACL (1)
NIPS (1)
Top co-authors
Keywords
voice conversion
(3)
text-to-speech synthesis
(3)
style modeling
(2)
singing voice synthesis
(2)
song generation
(2)
phonetic posteriorgram
(2)
speech synthesis
(2)
attention mechanism
(1)
self-supervised learning
(1)
semi-supervised learning
(1)
text representation
(1)
bert model
(1)
contextual information
(1)
automatic speech recognition
(1)
speech recognition
(1)
disentangled representation
(1)
transfer learning
(1)
autoregressive model
(1)
acoustic model
(1)
speaker embedding
(1)
Papers
SongCreator: Lyrics-based Universal Song Generation
NIPS 2024
ChatMusician: Understanding and Generating Music Intrinsically with LLM
ACL 2024
An End-to-End Approach for Chord-Conditioned Song Generation
INTERSPEECH 2024
Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis
INTERSPEECH 2023
Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis
INTERSPEECH 2022
Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information
INTERSPEECH 2022
Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information
INTERSPEECH 2022
VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis
INTERSPEECH 2021
Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion
INTERSPEECH 2021
DurIAN: Duration Informed Attention Network for Speech Synthesis
INTERSPEECH 2020
Transferring Source Style in Non-Parallel Voice Conversion
INTERSPEECH 2020
Multimedia Simultaneous Translation System for Minority Language Communication with Mandarin
INTERSPEECH 2019
One-Shot Voice Conversion with Global Speaker Embeddings
INTERSPEECH 2019
Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT
INTERSPEECH 2019
Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis
INTERSPEECH 2018
Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams
INTERSPEECH 2016