Jiangyan Yi
34 papers · 2017–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π£ Hot Topic Early Bird π§ Keyword Pioneer π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π Conference Polyglot (4)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π
Academic Marathon
(8)
π
Conference Loyalist
(27)
π€
Dynamic Duo
(32)
π
Keyword Champion
(3)
π₯
Unstoppable
(7)
β‘
Prolific Year
(10)
β
The Questioner
ποΈ
Keyword Collector
(129)
π
Century Club
(34)
Conferences
INTERSPEECH (27)
AAAI (3)
ICML (3)
COLING (1)
Top co-authors
Keywords
fake audio detection
(5)
catastrophic forgetting
(4)
automatic speech recognition
(4)
speech recognition
(4)
audio deepfake detection
(4)
knowledge distillation
(4)
attention mechanism
(4)
continual learning
(4)
end-to-end speech recognition
(3)
model compression
(3)
end-to-end model
(3)
speaker representation
(3)
deep clustering
(3)
punctuation prediction
(2)
character error rate
(2)
speaker adaptation
(2)
one-shot learning
(2)
deep embedding
(2)
speech synthesis
(2)
connectionist temporal classification
(2)
Papers
OV-MER: Towards Open-Vocabulary Multimodal Emotion Recognition
ICML 2025
AffectGPT: A New Dataset, Model, and Benchmark for Emotion Understanding with Multimodal Large Language Models
ICML 2025
Code-switching Mediated Sentence-level Semantic Learning
AAAI 2025
Region-Based Optimization in Continual Learning for Audio Deepfake Detection
AAAI 2025
What to Remember: Self-Adaptive Continual Learning for Audio Deepfake Detection
AAAI 2024
NLoPT: N-gram Enhanced Low-Rank Task Adaptive Pre-training for Efficient Language Model Adaption
COLING 2024
Frequency-mix Knowledge Distillation for Fake Speech Detection
INTERSPEECH 2024
TraceableSpeech: Towards Proactively Traceable Text-to-Speech with Watermarking
INTERSPEECH 2024
RawBMamba: End-to-End Bidirectional State Space Model for Audio Deepfake Detection
INTERSPEECH 2024
Residual Speaker Representation for One-Shot Voice Conversion
INTERSPEECH 2024
Enhancing Partially Spoofed Audio Localization with Boundary-aware Attention Mechanism
INTERSPEECH 2024
Do You Remember? Overcoming Catastrophic Forgetting for Fake Audio Detection
ICML 2023
TO-Rawnet: Improving RawNet with TCN and Orthogonal Regularization for Fake Audio Detection
INTERSPEECH 2023
Detection of Cross-Dataset Fake Audio Based on Prosodic and Pronunciation Features
INTERSPEECH 2023
reducing multilingual context confusion for end-to-end code-switching automatic speech recognition
INTERSPEECH 2022
Half-Truth: A Partially Fake Audio Detection Dataset
INTERSPEECH 2021
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
INTERSPEECH 2021
End-to-End Spelling Correction Conditioned on Acoustic Feature for Code-Switching Speech Recognition
INTERSPEECH 2021
Continual Learning for Fake Audio Detection
INTERSPEECH 2021
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations
INTERSPEECH 2020
Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis
INTERSPEECH 2020
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition
INTERSPEECH 2020
Focal Loss for Punctuation Prediction
INTERSPEECH 2020
Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis
INTERSPEECH 2020
Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation
INTERSPEECH 2020
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-Channel Speech Separation with Deep Embedding Representations
INTERSPEECH 2020
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition
INTERSPEECH 2020
Non-Autoregressive End-to-End TTS with Coarse-to-Fine Decoding
INTERSPEECH 2020
Bi-Level Speaker Supervision for One-Shot Speech Synthesis
INTERSPEECH 2020
Self-Attention Transducers for End-to-End Speech Recognition
INTERSPEECH 2019
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features
INTERSPEECH 2019
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting
INTERSPEECH 2019
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition
INTERSPEECH 2019
Distilling Knowledge from an Ensemble of Models for Punctuation Prediction
INTERSPEECH 2017