Zhiyuan Zhu
17 papers · 2023–2026 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Cross-Pollinator (7) π Renaissance Researcher (7) πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Interdisciplinary Bridge
π
Conference Polyglot
(8)
ποΈ
Keyword Collector
(58)
π
Century Club
(16)
β‘
Prolific Year
(12)
Conferences
ACL (4)
MICCAI (4)
AACL (2)
EMNLP (2)
IJCNLP (2)
COLING (1)
INTERSPEECH (1)
NIPS (1)
Top co-authors
Keywords
singing voice synthesis
(5)
speech synthesis
(4)
audio generation
(3)
generative model
(3)
contrastive learning
(3)
style transfer
(2)
flow matching
(2)
audio understanding
(2)
phoneme alignment
(2)
spatial audio
(2)
large language model
(2)
deep learning
(2)
augmented reality
(2)
virtual reality
(2)
neural network
(2)
speech analysis
(1)
speech enhancement
(1)
temporal reasoning
(1)
in-context learning
(1)
zero-shot learning
(1)
Papers
Cross-Modal Coreference Alignment: Enabling Reliable Information Transfer in Omni-LLMs
ACL 2026
Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches
AACL 2025
ASAudio: A Survey of Advanced Spatial Audio Research
AACL 2025
EvolveBench: A Comprehensive Benchmark for Assessing Temporal Awareness in LLMs on Evolving Knowledge
ACL 2025
TCSinger 2: Customizable Multilingual Zero-shot Singing Voice Synthesis
ACL 2025
STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation
ACL 2025
Versatile Framework for Song Generation with Prompt-based Control
EMNLP 2025
ADAptation: Reconstruction-based Unsupervised Active Learning for Breast Ultrasound Diagnosis
MICCAI 2025
Hierarchical Corpus-View-Category Refinement for Carotid Plaque Risk Grading in Ultrasound
MICCAI 2025
MReg: A Novel Regression Model with MoE-based Video Feature Mining for Mitral Regurgitation Diagnosis
MICCAI 2025
Spatio-temporal Pre-trained Foundation Model for Neural Decoding with Fine-grained Optimization
MICCAI 2025
Synthetic Singers: A Review of Deep-Learning-based Singing Voice Synthesis Approaches
IJCNLP 2025
ASAudio: A Survey of Advanced Spatial Audio Research
IJCNLP 2025
RA2FD: Distilling Faithfulness into Efficient Dialogue Systems
EMNLP 2024
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks
NIPS 2024
CE-VDG: Counterfactual Entropy-based Bias Reduction for Video-grounded Dialogue Generation
COLING 2024
Contrastive Learning Based ASR Robust Knowledge Selection For Spoken Dialogue System
INTERSPEECH 2023