conftrace_

Yiwei Guo

8 papers · 2022–2026 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (4) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (15)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🔥 Unstoppable (5)

Conferences

INTERSPEECH (4) AAAI (2) ICML (1) NIPS (1)

Top co-authors

Kai Yu (6) Xie Chen (5) Chenpeng Du (3) Shuai Wang (3) Feiyu Shen (2) Yali Wang (2) Shaobin Zhuang (2) Bohan Li (2) Kunchang Li (2) Baihan Li (1)

Keywords

text-to-speech synthesis (3) vector quantization (2) acoustic token (2) self-supervised learning (1) knowledge distillation (1) multimodal learning (1) taxonomy construction (1) speaker embedding (1) acoustic model (1) diffusion model (1) multimodal dataset (1) heterogeneous agent (1) prompt sensitivity (1) vision-language foundation model (1) neural vocoder (1) acoustic feature (1) byte-pair encoding (1) spoken language modeling (1) decoder-only model (1) semantic token (1)

Papers

AHAMask: Reliable Task Specification for Large Audio Language Models Without Instructions AAAI 2026 TimeStep Master: Asymmetrical Mixture of Timestep LoRA Experts for Versatile and Efficient Diffusion Models in Vision ICML 2025 TransAgent: Transfer Vision-Language Foundation Models with Heterogeneous Agent Collaboration NIPS 2024 UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding AAAI 2024 On the Effectiveness of Acoustic BPE in Decoder-Only TTS INTERSPEECH 2024 DiveSound: LLM-Assisted Automatic Taxonomy Construction for Diverse Audio Generation INTERSPEECH 2024 DSE-TTS: Dual Speaker Embedding for Cross-Lingual Text-to-Speech INTERSPEECH 2023 VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature INTERSPEECH 2022