Jinzheng He
16 papers · 2022–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🐝 Cross-Pollinator (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (5) 🌈 Renaissance Researcher (7)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🤝
Dynamic Duo
(16)
⚡
Prolific Year
(7)
🗃️
Keyword Collector
(64)
💎
Century Club
(16)
Conferences
ACL (5)
ICLR (4)
NIPS (3)
AAAI (2)
EMNLP (2)
Top co-authors
Keywords
speech synthesis
(5)
singing voice synthesis
(5)
style transfer
(3)
generative model
(2)
voice conversion
(2)
zero-shot learning
(2)
music corpus
(2)
self-supervised learning
(1)
multimodal learning
(1)
video generation
(1)
diffusion model
(1)
facial animation
(1)
cross-modal learning
(1)
automatic speech recognition
(1)
speech recognition
(1)
vector quantization
(1)
prosody prediction
(1)
speech analysis
(1)
talking face generation
(1)
domain generalization
(1)
Papers
WavRAG: Audio-Integrated Retrieval Augmented Generation for Spoken Dialogue Models
ACL 2025
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
NIPS 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
NIPS 2024
StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis
AAAI 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
ICLR 2024
Wav2SQL: Direct Generalizable Speech-To-SQL Parsing
ACL 2024
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control
EMNLP 2024
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
ICLR 2024
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
ACL 2023
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training
ACL 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
ICLR 2023
ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer
EMNLP 2023
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
ICLR 2023
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
ACL 2023
Flow-Based Unconstrained Lip to Speech Generation
AAAI 2022
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
NIPS 2022