Yujun Wang
18 papers · 2018–2026 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+9 more ↓ Show less ↑
π Conference Polyglot (2) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (10) π§ Keyword Pioneer π Academic Marathon (7)
π
Cross-Pollinator
(8)
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π§¬
Topic Evolution
π€
Dynamic Duo
(10)
β‘
Prolific Year
(6)
π₯
Unstoppable
(6)
ποΈ
Keyword Collector
(95)
π
Century Club
(17)
Conferences
INTERSPEECH (16)
AAAI (1)
ACL (1)
Top co-authors
Keywords
speech recognition
(4)
transfer learning
(3)
keyword spotting
(3)
model compression
(3)
self-supervised learning
(2)
text-to-speech synthesis
(2)
speaker adaptation
(2)
audio encoder
(2)
audio tagging
(2)
knowledge distillation
(2)
domain adaptation
(1)
speech synthesis
(1)
non-native speech
(1)
multimodal learning
(1)
contrastive learning
(1)
attention mechanism
(1)
few-shot learning
(1)
cross-modal retrieval
(1)
visual representation
(1)
model architecture
(1)
Papers
ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM
AAAI 2026
LLaVA Steering: Visual Instruction Tuning with 500x Fewer Parameters through Modality Linear Representation-Steering
ACL 2025
Bridging Language Gaps in Audio-Text Retrieval
INTERSPEECH 2024
Towards Expressive Zero-Shot Speech Synthesis with Hierarchical Prosody Modeling
INTERSPEECH 2024
Scaling up masked audio encoder learning for general audio classification
INTERSPEECH 2024
Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding
INTERSPEECH 2024
Streaming Audio Transformers for Online Audio Tagging
INTERSPEECH 2024
Speaker Change Detection with Weighted-sum Knowledge Distillation based on Self-supervised Pre-trained Models
INTERSPEECH 2024
LightClone: Speaker-guided Parallel Subnet Selection for Few-shot Voice Cloning
INTERSPEECH 2023
Improving Bilingual TTS Using Language And Phonology Embedding With Embedding Strength Modulator
INTERSPEECH 2023
Focus on the Sound around You: Monaural Target Speaker Extraction via Distance and Speaker Information
INTERSPEECH 2023
Exploring representation learning for small-footprint keyword spotting
INTERSPEECH 2022
UniKW-AT: Unified Keyword Spotting and Audio Tagging
INTERSPEECH 2022
speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment
INTERSPEECH 2021
Exploiting Deep Sentential Context for Expressive End-to-End Speech Synthesis
INTERSPEECH 2020
Investigating Generative Adversarial Networks Based Speech Dereverberation for Robust Speech Recognition
INTERSPEECH 2018
Empirical Evaluation of Speaker Adaptation on DNN Based Acoustic Model
INTERSPEECH 2018
Attention-based End-to-End Models for Small-Footprint Keyword Spotting
INTERSPEECH 2018