Jinglin Liu

25 papers · 2020–2024 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🗺️ Taxonomy Completionist (46) 🌈 Renaissance Researcher (8) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (7) 🧭 Keyword Pioneer

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏆 Grand Slam 🤝 Dynamic Duo (25) ⚡ Prolific Year (9) 💎 Century Club (25) 🔥 Unstoppable (5) 🗃️ Keyword Collector (106)

Conferences

ACL (9) NIPS (5) AAAI (4) ICLR (4) ICML (1) IJCAI (1) INTERSPEECH (1)

Top co-authors

Zhou Zhao (25) Yi Ren (22) Rongjie Huang (13) Zhenhui Ye (10) Jinzheng He (10) Ziyue Jiang (7) Xiang Yin (6) Chen Zhang (5) Lichao Zhang (4) Chenye Cui (4)

Keywords

speech synthesis (8) diffusion model (3) singing voice synthesis (3) voice conversion (3) prosody modeling (3) neural machine translation (2) neural network (2) knowledge distillation (2) audio generation (2) speech generation (2) generative adversarial network (2) attention mechanism (2) facial animation (2) variational autoencoder (2) generative model (2) multimodal learning (1) video generation (1) speech recognition (1) curriculum learning (1) zero-shot learning (1)

Papers

Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis ICLR 2024 MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes NIPS 2024 AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head AAAI 2024 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis ICLR 2024 DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect ACL 2023 Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models ICML 2023 GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis ICLR 2023 AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation ACL 2023 CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training ACL 2023 RMSSinger: Realistic-Music-Score based Singing Voice Synthesis ACL 2023 FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis ACL 2023 AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment ACL 2023 TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation ICLR 2023 Flow-Based Unconstrained Lip to Speech Generation AAAI 2022 Parallel and High-Fidelity Text-to-Lip Generation AAAI 2022 DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism AAAI 2022 GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech NIPS 2022 Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech NIPS 2022 M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus NIPS 2022 Learning the Beauty in Songs: Neural Singing Voice Beautifier ACL 2022 EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model INTERSPEECH 2021 PortaSpeech: Portable and High-Quality Generative Text-to-Speech NIPS 2021 A Study of Non-autoregressive Model for Sequence Generation ACL 2020 Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation IJCAI 2020 SimulSpeech: End-to-End Simultaneous Speech to Text Translation ACL 2020