Kai Shen
7 papers · 2020–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
🌍 Conference Polyglot (6) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer
🐝
Cross-Pollinator
(15)
Conferences
ICLR (2)
ACL (1)
EMNLP (1)
ICCV (1)
ICML (1)
IJCAI (1)
Top co-authors
Keywords
video generation
(1)
vector quantization
(1)
diffusion model
(1)
autoregressive model
(1)
language model
(1)
text-to-video generation
(1)
hybrid architecture
(1)
sequence-to-sequence model
(1)
hierarchical attention
(1)
speech recognition error
(1)
masking strategy
(1)
grounded generation
(1)
non-autoregressive generation
(1)
video description
(1)
semantic tokenizer
(1)
coarse-to-fine generation
(1)
spatial-temporal graph
(1)
sign language production
(1)
dynamic encoding
(1)
text error correction
(1)
Papers
The Best of Both Worlds: Integrating Language Models and Diffusion Models for Video Generation
ICCV 2025
NaturalSpeech 2: Latent Diffusion Models are Natural and Zero-Shot Speech and Singing Synthesizers
ICLR 2024
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024
NaturalSpeech 3: Zero-Shot Speech Synthesis with Factorized Codec and Diffusion Models
ICML 2024
T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text
ACL 2024
Mask the Correct Tokens: An Embarrassingly Simple Approach for Error Correction
EMNLP 2022
Hierarchical Attention Based Spatial-Temporal Graph-to-Sequence Learning for Grounded Video Description
IJCAI 2020