Jiangyan Yi

34 papers · 2017–2025 · 4 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🌍 Conference Polyglot (4)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🏃 Academic Marathon (8) 🏠 Conference Loyalist (27) 🤝 Dynamic Duo (32) 🏆 Keyword Champion (3) 🔥 Unstoppable (7) ⚡ Prolific Year (10) ❓ The Questioner 🗃️ Keyword Collector (129) 💎 Century Club (34)

Conferences

INTERSPEECH (27) AAAI (3) ICML (3) COLING (1)

Top co-authors

Jianhua Tao (32) Zhengqi Wen (19) Zhengkun Tian (11) Ye Bai (11) Cunhang Fan (8) Chenglong Wang (8) SHUAI ZHANG (8) Ruibo Fu (8) Tao Wang (8) Chu Yuan Zhang (6)

Keywords

fake audio detection (5) catastrophic forgetting (4) automatic speech recognition (4) speech recognition (4) audio deepfake detection (4) knowledge distillation (4) attention mechanism (4) continual learning (4) end-to-end speech recognition (3) model compression (3) end-to-end model (3) speaker representation (3) deep clustering (3) punctuation prediction (2) character error rate (2) speaker adaptation (2) one-shot learning (2) deep embedding (2) speech synthesis (2) connectionist temporal classification (2)

Papers

OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition ICML 2025 AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models ICML 2025 Code-switching Mediated Sentence-level Semantic Learning AAAI 2025 Region-Based Optimization in Continual Learning for Audio Deepfake Detection AAAI 2025 What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection AAAI 2024 NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption COLING 2024 Frequency-mix Knowledge Distillation for Fake Speech Detection INTERSPEECH 2024 TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking INTERSPEECH 2024 RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection INTERSPEECH 2024 Residual Speaker Representation for One-Shot Voice Conversion INTERSPEECH 2024 Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism INTERSPEECH 2024 Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection ICML 2023 TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection INTERSPEECH 2023 Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features INTERSPEECH 2023 reducing multilingual context confusion for end-to-end code-switching automatic speech recognition INTERSPEECH 2022 Half-Truth: A Partially Fake Audio Detection Dataset INTERSPEECH 2021 FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization INTERSPEECH 2021 End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition INTERSPEECH 2021 Continual Learning for Fake Audio Detection INTERSPEECH 2021 Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations INTERSPEECH 2020 Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis INTERSPEECH 2020 Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition INTERSPEECH 2020 Focal Loss for Punctuation Prediction INTERSPEECH 2020 Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis INTERSPEECH 2020 Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation INTERSPEECH 2020 Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations INTERSPEECH 2020 Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition INTERSPEECH 2020 Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding INTERSPEECH 2020 Bi-Level Speaker Supervision for One-Shot Speech Synthesis INTERSPEECH 2020 Self-Attention Transducers for End-to-End Speech Recognition INTERSPEECH 2019 Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features INTERSPEECH 2019 A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting INTERSPEECH 2019 Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition INTERSPEECH 2019 Distilling Knowledge from an Ensemble of Models for Punctuation Prediction INTERSPEECH 2017