Helin Wang

17 papers · 2020–2025 · 7 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🧭 Keyword Pioneer 🐝 Cross-Pollinator (10) 🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🌈 Renaissance Researcher (6)

🧭 Keyword Pioneer 🌍 Conference Polyglot (7) 🏃 Academic Marathon (5) 🏆 Grand Slam 🧬 Topic Evolution 💎 Century Club (17) 🗃️ Keyword Collector (71) 🔥 Unstoppable (6)

Conferences

INTERSPEECH (11) AAAI (1) COLING (1) ICCV (1) ICLR (1) ICML (1) NIPS (1)

Top co-authors

Yuexian Zou (7) Dongchao Yang (5) Jianwei Yu (3) Najim Dehak (3) Dading Chong (3) Thomas Thebaud (3) Chao Weng (3) Wenwu Wang (3) Jiarui Hai (2) Jesus Villalba (2)

Keywords

diffusion model (3) acoustic scene classification (3) voice conversion (2) knowledge distillation (2) neural network (2) attention mechanism (2) temporal attention (2) unsupervised domain adaptation (1) dataset creation (1) benchmark evaluation (1) multimodal learning (1) video understanding (1) speech processing (1) machine reading comprehension (1) noise robustness (1) speech separation (1) audio source separation (1) speech dereverberation (1) data augmentation (1) token efficiency (1)

Papers

DisCo: Towards Distinct and Coherent Visual Encapsulation in Video MLLMs ICCV 2025 Audio Large Language Models Can Be Descriptive Speech Quality Evaluators ICLR 2025 ALMTokenizer: A Low-bitrate and Semantic-rich Audio Codec Tokenizer for Audio Language Modeling ICML 2025 Noise-robust Speech Separation with Fast Generative Correction INTERSPEECH 2024 DreamVoice: Text-Guided Voice Conversion INTERSPEECH 2024 Finding Spoken Identifications: Using GPT-4 Annotation for an Efficient and Fast Dataset Creation Pipeline COLING 2024 Benchmarking Large Language Models on CMExam - A comprehensive Chinese Medical Exam Dataset NIPS 2023 DuTa-VC: A Duration-aware Typical-to-atypical Voice Conversion Approach with Diffusion Probabilistic Model INTERSPEECH 2023 NoreSpeech: Knowledge Distillation based Conditional Diffusion Model for Noise-robust Expressive TTS INTERSPEECH 2023 Improving Target Sound Extraction with Timestamp Information INTERSPEECH 2022 Calibrate and Refine! A Novel and Agile Framework for ASR Error Robust Intent Detection INTERSPEECH 2022 RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection INTERSPEECH 2022 Unsupervised Multi-Target Domain Adaptation for Acoustic Scene Classification INTERSPEECH 2021 TeCANet: Temporal-Contextual Attention Network for Environment-Aware Speech Dereverberation INTERSPEECH 2021 Audio-Oriented Multimodal Machine Comprehension via Dynamic Inter- and Intra-modality Attention AAAI 2021 SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification INTERSPEECH 2021 Environmental Sound Classification with Parallel Temporal-Spectral Attention INTERSPEECH 2020