Zhifang Guo
5 papers · 2024–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Interdisciplinary Bridge π§ Keyword Pioneer π Conference Polyglot (4) π Cross-Pollinator (14) π Renaissance Researcher (5)
πΊοΈ
Taxonomy Completionist
(21)
π£
Hot Topic Early Bird
Conferences
ACL (2)
AAAI (1)
ICLR (1)
INTERSPEECH (1)
Top co-authors
Keywords
audio generation
(2)
multimodal learning
(2)
language model
(2)
speech processing
(2)
audio classification
(1)
acoustic model
(1)
diffusion model
(1)
pre-trained model
(1)
controllable generation
(1)
discrete token
(1)
discrete representation
(1)
sound event detection
(1)
speech tokenization
(1)
speech language model
(1)
speech generation
(1)
discrete speech token
(1)
speech instruction following
(1)
speech-text alignment
(1)
speech large language model
(1)
neural codec
(1)
Papers
InSerter: Speech Instruction Following with Unsupervised Interleaved Pre-training
ACL 2025
Analyzing and Mitigating Inconsistency in Discrete Speech Tokens for Neural Codec Language Models
ACL 2025
Audio Generation with Multiple Conditional Diffusion Model
AAAI 2024
PromptTTS 2: Describing and Generating Voices with Text Prompt
ICLR 2024
Leveraging Language Model Capabilities for Sound Event Detection
INTERSPEECH 2024