conftrace_

Jinzheng He

16 papers · 2022–2025 · 5 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🐝 Cross-Pollinator (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (7)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🤝 Dynamic Duo (16) ⚡ Prolific Year (7) 🗃️ Keyword Collector (64) 💎 Century Club (16)

Conferences

ACL (5) ICLR (4) NIPS (3) AAAI (2) EMNLP (2)

Top co-authors

Zhou Zhao (16) Rongjie Huang (11) Jinglin Liu (10) Yi Ren (9) Ziyue Jiang (7) Zhenhui Ye (7) Xiang Yin (5) Huadai Liu (5) Lichao Zhang (4) Ruiqi Li (4)

Keywords

speech synthesis (5) singing voice synthesis (5) style transfer (3) generative model (2) voice conversion (2) zero-shot learning (2) music corpus (2) self-supervised learning (1) multimodal learning (1) video generation (1) diffusion model (1) facial animation (1) cross-modal learning (1) automatic speech recognition (1) speech recognition (1) vector quantization (1) prosody prediction (1) speech analysis (1) talking face generation (1) domain generalization (1)

Papers

WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models ACL 2025 GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks NIPS 2024 MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes NIPS 2024 StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis AAAI 2024 Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis ICLR 2024 Wav2SQL: Direct Generalizable Speech-To-SQL Parsing ACL 2024 TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control EMNLP 2024 Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis ICLR 2024 RMSSinger: Realistic-Music-Score based Singing Voice Synthesis ACL 2023 CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training ACL 2023 GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis ICLR 2023 ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer EMNLP 2023 TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation ICLR 2023 AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation ACL 2023 Flow-Based Unconstrained Lip to Speech Generation AAAI 2022 M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus NIPS 2022