Haohan Guo
10 papers · 2019–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (4) π Academic Marathon (6) π£ Hot Topic Early Bird π Cross-Pollinator (9)
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(18)
π
Century Club
(10)
Conferences
INTERSPEECH (6)
ICML (2)
ACL (1)
NIPS (1)
Top co-authors
Keywords
text-to-speech synthesis
(3)
neural vocoder
(2)
speech codec
(2)
generative adversarial network
(2)
speech synthesis
(2)
cross-modal learning
(1)
speech enhancement
(1)
prosody prediction
(1)
vector quantization
(1)
diffusion model
(1)
autoregressive model
(1)
end-to-end learning
(1)
audio codec
(1)
exposure bia
(1)
phrase structure
(1)
end-to-end model
(1)
speech generation
(1)
neural codec
(1)
non-autoregressive model
(1)
speech reconstruction
(1)
Papers
PodAgent: A Comprehensive Framework for Podcast Generation
ACL 2025
ALMTokenizer: A Low-bitrate and Semantic-rich Audio Codec Tokenizer for Audio Language Modeling
ICML 2025
UniAudio 1.5: Large Language Model-Driven Audio Codec is A Few-Shot Audio Task Learner
NIPS 2024
UniAudio: Towards Universal Audio Generation with Large Language Models
ICML 2024
Single-Codec: Single-Codebook Speech Codec towards High-Performance Speech Generation
INTERSPEECH 2024
SimpleSpeech: Towards Simple and Efficient Text-to-Speech with Scalar Latent Transformer Diffusion Models
INTERSPEECH 2024
A Multi-Scale Time-Frequency Spectrogram Discriminator for GAN-based Non-Autoregressive TTS
INTERSPEECH 2022
A Multi-Stage Multi-Codebook VQ-VAE Approach to High-Performance Neural TTS
INTERSPEECH 2022
A New GAN-Based End-to-End TTS Training Algorithm
INTERSPEECH 2019
Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS
INTERSPEECH 2019