Joon-Hyuk Chang
39 papers · 2019–2026 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (22) π Renaissance Researcher (6) π Conference Polyglot (3)
π£
Hot Topic Early Bird
π§
Keyword Pioneer
π
Renaissance Researcher
(6)
π
Conference Loyalist
(36)
π§¬
Topic Evolution
π
Keyword Champion
(2)
π
Century Club
(38)
ποΈ
Keyword Collector
(61)
β‘
Prolific Year
(15)
π₯
Unstoppable
(6)
Conferences
INTERSPEECH (36)
AAAI (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
automatic speech recognition
(7)
speaker embedding
(6)
wav2vec 2.0
(3)
speaker verification
(3)
knowledge distillation
(3)
continual learning
(3)
speech enhancement
(3)
speech synthesis
(3)
deep neural network
(3)
transformer encoder
(2)
sound source localization
(2)
diffusion model
(2)
domain adaptation
(2)
self-supervised learning
(2)
speaker diarization
(2)
representation learning
(2)
adversarial training
(2)
online learning
(2)
generative model
(2)
speaker adaptation
(2)
Papers
OnEDIT: Online Editing with Decoupled Implicit Task for Large Language Models
AAAI 2026
Stationary Latent Weight Inference for Unreliable Observations from Online Test-Time Adaptation
ICML 2024
Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation
ICLR 2024
Guided conditioning with predictive network on score-based diffusion model for speech enhancement
INTERSPEECH 2024
TSP-TTS: Text-based Style Predictor with Residual Vector Quantization for Expressive Text-to-Speech
INTERSPEECH 2024
Whisper Multilingual Downstream Task Tuning Using Task Vectors
INTERSPEECH 2024
Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation
INTERSPEECH 2024
Sound of Vision: Audio Generation from Visual Text Embedding through Training Domain Discriminator
INTERSPEECH 2024
Retrieval-Augmented Classifier Guidance for Audio Generation
INTERSPEECH 2024
Efficient Speaker Embedding Extraction Using a Twofold Sliding Window Algorithm for Speaker Diarization
INTERSPEECH 2024
Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization
INTERSPEECH 2024
Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning
INTERSPEECH 2024
Neural ATSM: Fully Neural Network-based Adaptive Time-Scale Modification Using Sentence-Specific Dynamic Control
INTERSPEECH 2024
H4C-TTS: Leveraging Multi-Modal Historical Context for Conversational Text-to-Speech
INTERSPEECH 2024
Balanced-Wav2Vec: Enhancing Stability and Robustness of Representation Learning Through Sample Reweighting Techniques
INTERSPEECH 2024
Few-Shot Keyword-Incremental Learning with Total Calibration
INTERSPEECH 2024
Resolution Consistency Training on Time-Frequency Domain for Semi-Supervised Sound Event Detection
INTERSPEECH 2023
General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization
INTERSPEECH 2023
Intra-ensemble: A New Method for Combining Intermediate Outputs in Transformer-based Automatic Speech Recognition
INTERSPEECH 2023
HAD-ANC: A Hybrid System Comprising an Adaptive Filter and Deep Neural Networks for Active Noise Control
INTERSPEECH 2023
Self-Distillation into Self-Attention Heads for Improving Transformer-based End-to-End Neural Speaker Diarization
INTERSPEECH 2023
Deeply Supervised Curriculum Learning for Deep Neural Network-based Sound Source Localization
INTERSPEECH 2023
SR-SRP: Super-Resolution based SRP-PHAT for Sound Source Localization and Tracking
INTERSPEECH 2023
Prior-free Guided TTS: An Improved and Efficient Diffusion-based Text-Guided Speech Synthesis
INTERSPEECH 2023
Improving Joint Speech and Emotion Recognition Using Global Style Tokens
INTERSPEECH 2023
Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS
INTERSPEECH 2022
Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS
INTERSPEECH 2022
W2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition
INTERSPEECH 2022
CTRL: Continual Representation Learning to Transfer Information of Pre-trained for WAV2VEC 2.0
INTERSPEECH 2022
FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition
INTERSPEECH 2022
Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights
INTERSPEECH 2022
Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification
INTERSPEECH 2022
Convolutional Recurrent Neural Network with Auxiliary Stream for Robust Variable-Length Acoustic Scene Classification
INTERSPEECH 2022
HYU Submission for the SASV Challenge 2022: Reforming Speaker Embeddings with Spoofing-Aware Conditioning
INTERSPEECH 2022
One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS
INTERSPEECH 2022
Deep Neural Network Calibration for E2E Speech Recognition System
INTERSPEECH 2021
Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation
INTERSPEECH 2020
Attention Wave-U-Net for Acoustic Echo Cancellation
INTERSPEECH 2020
Joint Optimization of Neural Acoustic Beamforming and Dereverberation with x-Vectors for Robust Speaker Verification
INTERSPEECH 2019