Xubo Liu
20 papers · 2021–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🌍 Conference Polyglot (8) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🐝 Cross-Pollinator (12)
🌈
Renaissance Researcher
(9)
🌍
Conference Polyglot
(8)
🤝
Dynamic Duo
(12)
🔥
Unstoppable
(5)
💎
Century Club
(20)
📈
Trend Setter
⚡
Prolific Year
(5)
🗃️
Keyword Collector
(103)
Conferences
INTERSPEECH (10)
AAAI (2)
EMNLP (2)
ICML (2)
ACL (1)
COLING (1)
CVPR (1)
ICLR (1)
Top co-authors
Research topics
Keywords
multimodal learning
(3)
neural vocoder
(2)
text-to-audio generation
(2)
prompt tuning
(2)
dialogue generation
(2)
contrastive learning
(2)
audio classification
(2)
personalized dialogue
(2)
audio representation
(2)
source localization
(1)
question answering
(1)
natural language generation
(1)
transfer learning
(1)
response generation
(1)
metric learning
(1)
speech recognition
(1)
speech enhancement
(1)
machine translation
(1)
few-shot learning
(1)
low-resource learning
(1)
Papers
RiTTA: Modeling Event Relations in Text-to-Audio Generation
EMNLP 2025
Scaling Transformers for Low-Bitrate High-Quality Speech Coding
ICLR 2025
ALMTokenizer: A Low-bitrate and Semantic-rich Audio Codec Tokenizer for Audio Language Modeling
ICML 2025
Look before You Leap: Dual Logical Verification for Knowledge-based Visual Question Generation
COLING 2024
Learning Temporal Resolution in Spectrogram for Audio Classification
AAAI 2024
Selective Prompting Tuning for Personalized Conversations with LLMs
ACL 2024
Personalized Dialogue Generation with Persona-Adaptive Attention
AAAI 2023
SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
CVPR 2023
Learning Retrieval Augmentation for Personalized Dialogue Generation
EMNLP 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
ICML 2023
Adapting Language-Audio Models as Few-Shot Audio Learners
INTERSPEECH 2023
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
INTERSPEECH 2023
Ontology-aware Learning and Evaluation for Audio Tagging
INTERSPEECH 2023
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
INTERSPEECH 2023
Neural Vocoder is All You Need for Speech Super-resolution
INTERSPEECH 2022
VoiceFixer: A Unified Framework for High-Fidelity Speech Restoration
INTERSPEECH 2022
Separate What You Describe: Language-Queried Audio Source Separation
INTERSPEECH 2022
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter
INTERSPEECH 2022
On Metric Learning for Audio-Text Cross-Modal Retrieval
INTERSPEECH 2022
Token-Level Supervised Contrastive Learning for Punctuation Restoration
INTERSPEECH 2021