Wenwu Wang
28 papers · 2015–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
🏃 Academic Marathon (10) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)
🌈
Renaissance Researcher
(10)
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(10)
🤝
Dynamic Duo
(12)
🏆
Keyword Champion
(3)
💎
Century Club
(28)
📈
Trend Setter
🚀
Conference Pioneer
⚡
Prolific Year
(8)
🗃️
Keyword Collector
(143)
🔥
Unstoppable
(7)
Conferences
INTERSPEECH (18)
AAAI (2)
EMNLP (2)
ICML (2)
JMLR (2)
ACL (1)
IJCAI (1)
Top co-authors
Research topics
Keywords
attention mechanism
(4)
audio tagging
(3)
audio captioning
(3)
generative adversarial network
(2)
acoustic scene classification
(2)
derivative estimation
(2)
attention pooling
(2)
feature fusion
(2)
personalized dialogue
(2)
source separation
(2)
contrastive learning
(2)
embedding learning
(2)
dialogue generation
(2)
nonparametric regression
(2)
audio representation
(2)
multimodal learning
(2)
audio classification
(2)
graph representation learning
(1)
blind source separation
(1)
sparse representation
(1)
Papers
Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey
EMNLP 2025
PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning
INTERSPEECH 2024
Selective Prompting Tuning for Personalized Conversations with LLMs
ACL 2024
Efficient Audio Captioning with Encoder-Level Knowledge Distillation
INTERSPEECH 2024
Learning Temporal Resolution in Spectrogram for Audio Classification
AAAI 2024
Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning
INTERSPEECH 2023
Ontology-aware Learning and Evaluation for Audio Tagging
INTERSPEECH 2023
Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention
INTERSPEECH 2023
Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning
INTERSPEECH 2023
Personalized Dialogue Generation with Persona-Adaptive Attention
AAAI 2023
Learning Retrieval Augmentation for Personalized Dialogue Generation
EMNLP 2023
Adapting Language-Audio Models as Few-Shot Audio Learners
INTERSPEECH 2023
AudioLDM: Text-to-Audio Generation with Latent Diffusion Models
ICML 2023
Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter
INTERSPEECH 2022
DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting
ICML 2022
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
INTERSPEECH 2022
Separate What You Describe: Language-Queried Audio Source Separation
INTERSPEECH 2022
On Metric Learning for Audio-Text Cross-Modal Retrieval
INTERSPEECH 2022
Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction
INTERSPEECH 2021
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
INTERSPEECH 2021
Environmental Sound Classification with Parallel Temporal-Spectral Attention
INTERSPEECH 2020
Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging
INTERSPEECH 2020
Robust Estimation of Derivatives Using Locally Weighted Least Absolute Deviation Regression
JMLR 2019
Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks
IJCAI 2019
Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging
INTERSPEECH 2017
Matrix of Polynomials Model Based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling
INTERSPEECH 2017
Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm
INTERSPEECH 2016
Derivative Estimation Based on Difference Sequence via Locally Weighted Least Squares Regression
JMLR 2015