conftrace_

Yongqi Wang

13 papers · 2023–2025 · 6 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🐝 Cross-Pollinator (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (6) 🌈 Renaissance Researcher (7)

🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🤝 Dynamic Duo (11) ⚡ Prolific Year (10) 🗃️ Keyword Collector (70) 💎 Century Club (13)

Conferences

ACL (5) NIPS (3) AAAI (2) ICML (1) IJCAI (1) NAACL (1)

Top co-authors

Zhou Zhao (11) Rongjie Huang (10) Ruiqi Li (7) Zhiqing Hong (6) Zehan Wang (4) Fuming You (4) Li Tang (3) Luping Liu (2) Dongchao Yang (2) Xize Cheng (2)

Keywords

singing voice synthesis (5) self-supervised learning (2) visual relationship detection (2) video understanding (2) multimodal learning (2) multi-modal learning (2) contrastive learning (2) discrete representation (2) object detection (1) multilingual nlp (1) style transfer (1) voice conversion (1) speech synthesis (1) zero-shot conversion (1) zero-shot learning (1) attention mechanism (1) semantic alignment (1) embedding space (1) cross-modal retrieval (1) embedding learning (1)

Papers

TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching AAAI 2025 METOR: A Unified Framework for Mutual Enhancement of Objects and Relationships in Open-vocabulary Video Visual Relationship Detection IJCAI 2025 Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching NIPS 2024 Multi-Modal Prompting for Open-Vocabulary Video Visual Relationship Detection AAAI 2024 Text-to-Song: Towards Controllable Music Generation Incorporating Vocal and Accompaniment ACL 2024 Robust Singing Voice Transcription Serves Synthesis ACL 2024 Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners ACL 2024 Speech-to-Speech Translation with Discrete-Unit-Based Style Transfer ACL 2024 Self-Supervised Singing Voice Pre-Training towards Speech-to-Singing Conversion ACL 2024 InstructSpeech: Following Speech Editing Instructions via Large Language Models ICML 2024 Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt NAACL 2024 MoMu-Diffusion: On Learning Long-Term Motion-Music Synchronization and Correspondence NIPS 2024 Connecting Multi-modal Contrastive Representations NIPS 2023