Yongqi Wang
13 papers · 2023–2025 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🐝 Cross-Pollinator (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (7)
🌉
Interdisciplinary Bridge
🧭
Keyword Pioneer
🤝
Dynamic Duo
(11)
⚡
Prolific Year
(10)
🗃️
Keyword Collector
(70)
💎
Century Club
(13)
Conferences
ACL (5)
NIPS (3)
AAAI (2)
ICML (1)
IJCAI (1)
NAACL (1)
Top co-authors
Keywords
singing voice synthesis
(5)
self-supervised learning
(2)
visual relationship detection
(2)
video understanding
(2)
multimodal learning
(2)
multi-modal learning
(2)
contrastive learning
(2)
discrete representation
(2)
object detection
(1)
multilingual nlp
(1)
style transfer
(1)
voice conversion
(1)
speech synthesis
(1)
zero-shot conversion
(1)
zero-shot learning
(1)
attention mechanism
(1)
semantic alignment
(1)
embedding space
(1)
cross-modal retrieval
(1)
embedding learning
(1)
Papers
TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching
AAAI 2025
METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection
IJCAI 2025
Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching
NIPS 2024
Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection
AAAI 2024
Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment
ACL 2024
Robust Singing Voice Transcription Serves Synthesis
ACL 2024
Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners
ACL 2024
Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer
ACL 2024
Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion
ACL 2024
InstructSpeech: Following Speech Editing Instructions via Large Language Models
ICML 2024
Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt
NAACL 2024
MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence
NIPS 2024
Connecting Multi-modal Contrastive Representations
NIPS 2023