Yiwei Guo
8 papers · 2022–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (15)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Unstoppable
(5)
Conferences
INTERSPEECH (4)
AAAI (2)
ICML (1)
NIPS (1)
Top co-authors
Keywords
text-to-speech synthesis
(3)
vector quantization
(2)
acoustic token
(2)
self-supervised learning
(1)
knowledge distillation
(1)
multimodal learning
(1)
taxonomy construction
(1)
speaker embedding
(1)
acoustic model
(1)
diffusion model
(1)
multimodal dataset
(1)
heterogeneous agent
(1)
prompt sensitivity
(1)
vision-language foundation model
(1)
neural vocoder
(1)
acoustic feature
(1)
byte-pair encoding
(1)
spoken language modeling
(1)
decoder-only model
(1)
semantic token
(1)
Papers
AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions
AAAI 2026
TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision
ICML 2025
TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration
NIPS 2024
UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding
AAAI 2024
On the Effectiveness of Acoustic BPE in Decoder-Only TTS
INTERSPEECH 2024
DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation
INTERSPEECH 2024
DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech
INTERSPEECH 2023
VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature
INTERSPEECH 2022