Jinglin Liu
25 papers · 2020–2024 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (46) π Renaissance Researcher (8) π Interdisciplinary Bridge π Conference Polyglot (7) π§ Keyword Pioneer
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Grand Slam
π€
Dynamic Duo
(25)
β‘
Prolific Year
(9)
π
Century Club
(25)
π₯
Unstoppable
(5)
ποΈ
Keyword Collector
(106)
Conferences
ACL (9)
NIPS (5)
AAAI (4)
ICLR (4)
ICML (1)
IJCAI (1)
INTERSPEECH (1)
Top co-authors
Keywords
speech synthesis
(8)
diffusion model
(3)
singing voice synthesis
(3)
voice conversion
(3)
prosody modeling
(3)
neural machine translation
(2)
neural network
(2)
knowledge distillation
(2)
audio generation
(2)
speech generation
(2)
generative adversarial network
(2)
attention mechanism
(2)
facial animation
(2)
variational autoencoder
(2)
generative model
(2)
multimodal learning
(1)
video generation
(1)
speech recognition
(1)
curriculum learning
(1)
zero-shot learning
(1)
Papers
Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis
ICLR 2024
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes
NIPS 2024
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
AAAI 2024
Real3D-Portrait: One-shot Realistic 3D Talking Portrait Synthesis
ICLR 2024
DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect
ACL 2023
Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models
ICML 2023
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face Synthesis
ICLR 2023
AV-TranSpeech: Audio-Visual Robust Speech-to-Speech Translation
ACL 2023
CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training
ACL 2023
RMSSinger: Realistic-Music-Score based Singing Voice Synthesis
ACL 2023
FastDiff 2: Revisiting and Incorporating GANs and Diffusion Models in High-Fidelity Speech Synthesis
ACL 2023
AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment
ACL 2023
TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation
ICLR 2023
Flow-Based Unconstrained Lip to Speech Generation
AAAI 2022
Parallel and High-Fidelity Text-to-Lip Generation
AAAI 2022
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism
AAAI 2022
GenerSpeech: Towards Style Transfer for Generalizable Out-Of-Domain Text-to-Speech
NIPS 2022
Dict-TTS: Learning to Pronounce with Prior Dictionary Knowledge for Text-to-Speech
NIPS 2022
M4Singer: A Multi-Style, Multi-Singer and Musical Score Provided Mandarin Singing Corpus
NIPS 2022
Learning the Beauty in Songs: Neural Singing Voice Beautifier
ACL 2022
EMOVIE: A Mandarin Emotion Speech Dataset with a Simple Emotional Text-to-Speech Model
INTERSPEECH 2021
PortaSpeech: Portable and High-Quality Generative Text-to-Speech
NIPS 2021
A Study of Non-autoregressive Model for Sequence Generation
ACL 2020
Task-Level Curriculum Learning for Non-Autoregressive Neural Machine Translation
IJCAI 2020
SimulSpeech: End-to-End Simultaneous Speech to Text Translation
ACL 2020