Yujun Wang

18 papers · 2018–2026 · 3 conferences · across top CS/AI conferences

Achievements

+9 more ↓

🌍 Conference Polyglot (2) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (10) 🧭 Keyword Pioneer 🏃 Academic Marathon (7)

🐝 Cross-Pollinator (8) 🌈 Renaissance Researcher (6) 🌉 Interdisciplinary Bridge 🧬 Topic Evolution 🤝 Dynamic Duo (10) ⚡ Prolific Year (6) 🔥 Unstoppable (6) 🗃️ Keyword Collector (95) 💎 Century Club (17)

Conferences

INTERSPEECH (16) AAAI (1) ACL (1)

Top co-authors

Junbo Zhang (10) Zhiyong Yan (7) Yongqing Wang (7) Heinrich Dinkel (6) Lei Xie (5) Bin Wang (4) Fengyu Yang (3) Meng Meng (2) Yunpu Ma (2) peng gao (2)

Keywords

speech recognition (4) transfer learning (3) keyword spotting (3) model compression (3) self-supervised learning (2) text-to-speech synthesis (2) speaker adaptation (2) audio encoder (2) audio tagging (2) knowledge distillation (2) domain adaptation (1) speech synthesis (1) non-native speech (1) multimodal learning (1) contrastive learning (1) attention mechanism (1) few-shot learning (1) cross-modal retrieval (1) visual representation (1) model architecture (1)

Papers

ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM AAAI 2026 LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering ACL 2025 Bridging Language Gaps in Audio-Text Retrieval INTERSPEECH 2024 Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling INTERSPEECH 2024 Scaling up masked audio encoder learning for general audio classification INTERSPEECH 2024 Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding INTERSPEECH 2024 Streaming Audio Transformers for Online Audio Tagging INTERSPEECH 2024 Speaker Change Detection with Weighted-sum Knowledge Distillation based on Self-supervised Pre-trained Models INTERSPEECH 2024 LightClone: Speaker-guided Parallel Subnet Selection for Few-shot Voice Cloning INTERSPEECH 2023 Improving Bilingual TTS Using Language And Phonology Embedding With Embedding Strength Modulator INTERSPEECH 2023 Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information INTERSPEECH 2023 Exploring representation learning for small-footprint keyword spotting INTERSPEECH 2022 UniKW-AT: Unified Keyword Spotting and Audio Tagging INTERSPEECH 2022 speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment INTERSPEECH 2021 Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis INTERSPEECH 2020 Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition INTERSPEECH 2018 Empirical Evaluation of Speaker Adaptation on DNN Based Acoustic Model INTERSPEECH 2018 Attention-based End-to-End Models for Small-Footprint Keyword Spotting INTERSPEECH 2018