conftrace_

Joon-Hyuk Chang

39 papers · 2019–2026 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+10 more ↓

🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (22) 🌈 Renaissance Researcher (6) 🌍 Conference Polyglot (3)

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌈 Renaissance Researcher (6) 🏠 Conference Loyalist (36) 🧬 Topic Evolution 🏆 Keyword Champion (2) 💎 Century Club (38) 🗃️ Keyword Collector (61) ⚡ Prolific Year (15) 🔥 Unstoppable (6)

Conferences

INTERSPEECH (36) AAAI (1) ICLR (1) ICML (1)

Top co-authors

Jae-Hong Lee (8) Jeong-Hwan Choi (6) Ye-Rin Jeoung (5) Won-Gook Choi (5) Joon-Young Yang (5) Dohee Kim (4) Ju-Seok Seong (4) Mun-Hak Lee (3) Jaeuk Lee (3) Jehyun Kyung (3)

Keywords

automatic speech recognition (7) speaker embedding (6) wav2vec 2.0 (3) speaker verification (3) knowledge distillation (3) continual learning (3) speech enhancement (3) speech synthesis (3) deep neural network (3) transformer encoder (2) sound source localization (2) diffusion model (2) domain adaptation (2) self-supervised learning (2) speaker diarization (2) representation learning (2) adversarial training (2) online learning (2) generative model (2) speaker adaptation (2)

Papers

OnEDIT: Online Editing with Decoupled Implicit Task for Large Language Models AAAI 2026 Stationary Latent Weight Inference for Unreliable Observations from Online Test-Time Adaptation ICML 2024 Continual Momentum Filtering on Parameter Space for Online Test-time Adaptation ICLR 2024 Guided conditioning with predictive network on score-based diffusion model for speech enhancement INTERSPEECH 2024 TSP-TTS: Text-based Style Predictor with Residual Vector Quantization for Expressive Text-to-Speech INTERSPEECH 2024 Whisper Multilingual Downstream Task Tuning Using Task Vectors INTERSPEECH 2024 Online Subloop Search via Uncertainty Quantization for Efficient Test-Time Adaptation INTERSPEECH 2024 Sound of Vision: Audio Generation from Visual Text Embedding through Training Domain Discriminator INTERSPEECH 2024 Retrieval-Augmented Classifier Guidance for Audio Generation INTERSPEECH 2024 Efficient Speaker Embedding Extraction Using a Twofold Sliding Window Algorithm for Speaker Diarization INTERSPEECH 2024 Mitigating Overfitting in Structured Pruning of ASR Models with Gradient-Guided Parameter Regularization INTERSPEECH 2024 Enhancing Multimodal Emotion Recognition through ASR Error Compensation and LLM Fine-Tuning INTERSPEECH 2024 Neural ATSM: Fully Neural Network-based Adaptive Time-Scale Modification Using Sentence-Specific Dynamic Control INTERSPEECH 2024 H4C-TTS: Leveraging Multi-Modal Historical Context for Conversational Text-to-Speech INTERSPEECH 2024 Balanced-Wav2Vec: Enhancing Stability and Robustness of Representation Learning Through Sample Reweighting Techniques INTERSPEECH 2024 Few-Shot Keyword-Incremental Learning with Total Calibration INTERSPEECH 2024 Resolution Consistency Training on Time-Frequency Domain for Semi-Supervised Sound Event Detection INTERSPEECH 2023 General-purpose Adversarial Training for Enhanced Automatic Speech Recognition Model Generalization INTERSPEECH 2023 Intra-ensemble: A New Method for Combining Intermediate Outputs in Transformer-based Automatic Speech Recognition INTERSPEECH 2023 HAD-ANC: A Hybrid System Comprising an Adaptive Filter and Deep Neural Networks for Active Noise Control INTERSPEECH 2023 Self-Distillation into Self-Attention Heads for Improving Transformer-based End-to-End Neural Speaker Diarization INTERSPEECH 2023 Deeply Supervised Curriculum Learning for Deep Neural Network-based Sound Source Localization INTERSPEECH 2023 SR-SRP: Super-Resolution based SRP-PHAT for Sound Source Localization and Tracking INTERSPEECH 2023 Prior-free Guided TTS: An Improved and Efficient Diffusion-based Text-Guided Speech Synthesis INTERSPEECH 2023 Improving Joint Speech and Emotion Recognition Using Global Style Tokens INTERSPEECH 2023 Adversarial and Sequential Training for Cross-lingual Prosody Transfer TTS INTERSPEECH 2022 Advanced Speaker Embedding with Predictive Variance of Gaussian Distribution for Speaker Adaptation in TTS INTERSPEECH 2022 W2V2-Light: A Lightweight Version of Wav2vec 2.0 for Automatic Speech Recognition INTERSPEECH 2022 CTRL: Continual Representation Learning to Transfer Information of Pre-trained for WAV2VEC 2.0 INTERSPEECH 2022 FiLM Conditioning with Enhanced Feature to the Transformer-based End-to-End Noisy Speech Recognition INTERSPEECH 2022 Regularizing Transformer-based Acoustic Models by Penalizing Attention Weights INTERSPEECH 2022 Improved CNN-Transformer using Broadcasted Residual Learning for Text-Independent Speaker Verification INTERSPEECH 2022 Convolutional Recurrent Neural Network with Auxiliary Stream for Robust Variable-Length Acoustic Scene Classification INTERSPEECH 2022 HYU Submission for the SASV Challenge 2022: Reforming Speaker Embeddings with Spoofing-Aware Conditioning INTERSPEECH 2022 One-Shot Speaker Adaptation Based on Initialization by Generative Adversarial Networks for TTS INTERSPEECH 2022 Deep Neural Network Calibration for E2E Speech Recognition System INTERSPEECH 2021 Virtual Acoustic Channel Expansion Based on Neural Networks for Weighted Prediction Error-Based Speech Dereverberation INTERSPEECH 2020 Attention Wave-U-Net for Acoustic Echo Cancellation INTERSPEECH 2020 Joint Optimization of Neural Acoustic Beamforming and Dereverberation with x-Vectors for Robust Speaker Verification INTERSPEECH 2019