Helin Wang
17 papers · 2020–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
🧭 Keyword Pioneer 🐝 Cross-Pollinator (10) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (6)
🧭
Keyword Pioneer
🌍
Conference Polyglot
(7)
🏃
Academic Marathon
(5)
🏆
Grand Slam
🧬
Topic Evolution
💎
Century Club
(17)
🗃️
Keyword Collector
(71)
🔥
Unstoppable
(6)
Conferences
INTERSPEECH (11)
AAAI (1)
COLING (1)
ICCV (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
diffusion model
(3)
acoustic scene classification
(3)
voice conversion
(2)
knowledge distillation
(2)
neural network
(2)
attention mechanism
(2)
temporal attention
(2)
unsupervised domain adaptation
(1)
dataset creation
(1)
benchmark evaluation
(1)
multimodal learning
(1)
video understanding
(1)
speech processing
(1)
machine reading comprehension
(1)
noise robustness
(1)
speech separation
(1)
audio source separation
(1)
speech dereverberation
(1)
data augmentation
(1)
token efficiency
(1)
Papers
DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs
ICCV 2025
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
ICLR 2025
ALMTokenizer: A Low-bitrate and Semantic-rich Audio Codec Tokenizer for Audio Language Modeling
ICML 2025
Noise-robust Speech Separation with Fast Generative Correction
INTERSPEECH 2024
DreamVoice: Text-Guided Voice Conversion
INTERSPEECH 2024
Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline
COLING 2024
Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset
NIPS 2023
DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model
INTERSPEECH 2023
NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS
INTERSPEECH 2023
Improving Target Sound Extraction with Timestamp Information
INTERSPEECH 2022
Calibrate and Refine! A Novel and Agile Framework for ASR Error Robust Intent Detection
INTERSPEECH 2022
RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
INTERSPEECH 2022
Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification
INTERSPEECH 2021
TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation
INTERSPEECH 2021
Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention
AAAI 2021
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification
INTERSPEECH 2021
Environmental Sound Classification with Parallel Temporal-Spectral Attention
INTERSPEECH 2020