Wenwu Wang

28 papers · 2015–2025 · 7 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🏃 Academic Marathon (10) 🌍 Conference Polyglot (7) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🐝 Cross-Pollinator (14)

🌈 Renaissance Researcher (10) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (10) 🤝 Dynamic Duo (12) 🏆 Keyword Champion (3) 💎 Century Club (28) 📈 Trend Setter 🚀 Conference Pioneer ⚡ Prolific Year (8) 🗃️ Keyword Collector (143) 🔥 Unstoppable (7)

Conferences

INTERSPEECH (18) AAAI (2) EMNLP (2) ICML (2) JMLR (2) ACL (1) IJCAI (1)

Top co-authors

Xubo Liu (12) Mark D. Plumbley (10) Haohe Liu (8) Qiuqiang Kong (6) Xinhao Mei (6) Qiushi Huang (5) Jianyuan Sun (4) Tom Ko (4) Yu Zhang (4) Yuexian Zou (4)

Research topics

Techniques (1) Analysis (1)

Keywords

attention mechanism (4) audio tagging (3) audio captioning (3) generative adversarial network (2) acoustic scene classification (2) derivative estimation (2) attention pooling (2) feature fusion (2) personalized dialogue (2) source separation (2) contrastive learning (2) embedding learning (2) dialogue generation (2) nonparametric regression (2) audio representation (2) multimodal learning (2) audio classification (2) graph representation learning (1) blind source separation (1) sparse representation (1)

Papers

Towards Controllable Speech Synthesis in the Era of Large Language Models: A Systematic Survey EMNLP 2025 PFCA-Net: Pyramid Feature Fusion and Cross Content Attention Network for Automated Audio Captioning INTERSPEECH 2024 Selective Prompting Tuning for Personalized Conversations with LLMs ACL 2024 Efficient Audio Captioning with Encoder-Level Knowledge Distillation INTERSPEECH 2024 Learning Temporal Resolution in Spectrogram for Audio Classification AAAI 2024 Dual Transformer Decoder based Features Fusion Network for Automated Audio Captioning INTERSPEECH 2023 Ontology-aware Learning and Evaluation for Audio Tagging INTERSPEECH 2023 Visually-Aware Audio Captioning With Adaptive Audio-Visual Attention INTERSPEECH 2023 Joint Prediction of Audio Event and Annoyance Rating in an Urban Soundscape by Hierarchical Graph Representation Learning INTERSPEECH 2023 Personalized Dialogue Generation with Persona-Adaptive Attention AAAI 2023 Learning Retrieval Augmentation for Personalized Dialogue Generation EMNLP 2023 Adapting Language-Audio Models as Few-Shot Audio Learners INTERSPEECH 2023 AudioLDM: Text-to-Audio Generation with Latent Diffusion Models ICML 2023 Audio Visual Multi-Speaker Tracking with Improved GCF and PMBM Filter INTERSPEECH 2022 DSTAGNN: Dynamic Spatial-Temporal Aware Graph Neural Network for Traffic Flow Forecasting ICML 2022 RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection INTERSPEECH 2022 Separate What You Describe: Language-Queried Audio Source Separation INTERSPEECH 2022 On Metric Learning for Audio-Text Cross-Modal Retrieval INTERSPEECH 2022 Crossfire Conditional Generative Adversarial Networks for Singing Voice Extraction INTERSPEECH 2021 SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification INTERSPEECH 2021 Environmental Sound Classification with Parallel Temporal-Spectral Attention INTERSPEECH 2020 Gated Multi-Head Attention Pooling for Weakly Labelled Audio Tagging INTERSPEECH 2020 Robust Estimation of Derivatives Using Locally Weighted Least Absolute Deviation Regression JMLR 2019 Single-Channel Signal Separation and Deconvolution with Generative Adversarial Networks IJCAI 2019 Attention and Localization Based on a Deep Convolutional Recurrent Model for Weakly Supervised Audio Tagging INTERSPEECH 2017 Matrix of Polynomials Model Based Polynomial Dictionary Learning Method for Acoustic Impulse Response Modeling INTERSPEECH 2017 Predicting Binaural Speech Intelligibility from Signals Estimated by a Blind Source Separation Algorithm INTERSPEECH 2016 Derivative Estimation Based on Difference Sequence via Locally Weighted Least Squares Regression JMLR 2015