Shiyin Kang

16 papers · 2016–2024 · 3 conferences · across top CS/AI conferences

Achievements

+10 more ↓

🏃 Academic Marathon (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (8)

🐝 Cross-Pollinator (8) 🌉 Interdisciplinary Bridge 🏆 Keyword Champion (2) 🤝 Dynamic Duo (12) 👥 Mega-Team (32) 🧬 Topic Evolution 🗃️ Keyword Collector (88) 🚀 Conference Pioneer 💎 Century Club (16) 🔥 Unstoppable (7)

Conferences

INTERSPEECH (14) ACL (1) NIPS (1)

Top co-authors

Helen Meng (12) Zhiyong Wu (11) Shun Lei (5) Deyi Tuo (4) Dan Su (4) Xixin Wu (4) Dong Yu (4) Yixuan Zhou (3) Xunying Liu (3) Hangyu Liu (2)

Keywords

voice conversion (3) text-to-speech synthesis (3) style modeling (2) singing voice synthesis (2) song generation (2) phonetic posteriorgram (2) speech synthesis (2) attention mechanism (1) self-supervised learning (1) semi-supervised learning (1) text representation (1) bert model (1) contextual information (1) automatic speech recognition (1) speech recognition (1) disentangled representation (1) transfer learning (1) autoregressive model (1) acoustic model (1) speaker embedding (1)

Papers

SongCreator: Lyrics-based Universal Song Generation NIPS 2024 ChatMusician: Understanding and Generating Music Intrinsically with LLM ACL 2024 An End-to-End Approach for Chord-Conditioned Song Generation INTERSPEECH 2024 Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis INTERSPEECH 2023 Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis INTERSPEECH 2022 Improving Mandarin Prosodic Structure Prediction with Multi-level Contextual Information INTERSPEECH 2022 Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information INTERSPEECH 2022 VAENAR-TTS: Variational Auto-Encoder Based Non-AutoRegressive Text-to-Speech Synthesis INTERSPEECH 2021 Adversarially Learning Disentangled Speech Representations for Robust Multi-Factor Voice Conversion INTERSPEECH 2021 DurIAN: Duration Informed Attention Network for Speech Synthesis INTERSPEECH 2020 Transferring Source Style in Non-Parallel Voice Conversion INTERSPEECH 2020 Multimedia Simultaneous Translation System for Minority Language Communication with Mandarin INTERSPEECH 2019 One-Shot Voice Conversion with Global Speaker Embeddings INTERSPEECH 2019 Disambiguation of Chinese Polyphones in an End-to-End Framework with Semantic Features Extracted by Pre-Trained BERT INTERSPEECH 2019 Rapid Style Adaptation Using Residual Error Embedding for Expressive Speech Synthesis INTERSPEECH 2018 Personalized, Cross-Lingual TTS Using Phonetic Posteriorgrams INTERSPEECH 2016