Zhiyuan Zhu

17 papers · 2023–2026 · 8 conferences · across top CS/AI conferences

Achievements

+4 more ↓

🐝 Cross-Pollinator (7) 🌈 Renaissance Researcher (7) 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge

🌍 Conference Polyglot (8) 🗃️ Keyword Collector (58) 💎 Century Club (16) ⚡ Prolific Year (12)

Conferences

ACL (4) MICCAI (4) AACL (2) EMNLP (2) IJCNLP (2) COLING (1) INTERSPEECH (1) NIPS (1)

Top co-authors

Yu Zhang (8) Changhao Pan (8) Wenxiang Guo (8) Zhou Zhao (8) Yu Wang (5) Yanfeng Wang (4) Jingyu Lu (4) Yusheng Liao (3) Ruiqi Li (3) Yuhao Huang (3)

Keywords

singing voice synthesis (5) speech synthesis (4) audio generation (3) generative model (3) contrastive learning (3) style transfer (2) flow matching (2) audio understanding (2) phoneme alignment (2) spatial audio (2) large language model (2) deep learning (2) augmented reality (2) virtual reality (2) neural network (2) speech analysis (1) speech enhancement (1) temporal reasoning (1) in-context learning (1) zero-shot learning (1)

Papers

Cross-Modal Coreference Alignment: Enabling Reliable Information Transfer in Omni-LLMs ACL 2026 Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches AACL 2025 ASAudio: A Survey of Advanced Spatial Audio Research AACL 2025 EvolveBench: A Comprehensive Benchmark for Assessing Temporal Awareness in LLMs on Evolving Knowledge ACL 2025 TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis ACL 2025 STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation ACL 2025 Versatile Framework for Song Generation with Prompt-based Control EMNLP 2025 ADAptation: Reconstruction-based Unsupervised Active Learning for Breast Ultrasound Diagnosis MICCAI 2025 Hierarchical Corpus-View-Category Refinement for Carotid Plaque Risk Grading in Ultrasound MICCAI 2025 MReg: A Novel Regression Model with MoE-based Video Feature Mining for Mitral Regurgitation Diagnosis MICCAI 2025 Spatio-temporal Pre-trained Foundation Model for Neural Decoding with Fine-grained Optimization MICCAI 2025 Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches IJCNLP 2025 ASAudio: A Survey of Advanced Spatial Audio Research IJCNLP 2025 RA2FD: Distilling Faithfulness into Efficient Dialogue Systems EMNLP 2024 GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks NIPS 2024 CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation COLING 2024 Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System INTERSPEECH 2023