Yifan Gong

47 papers · 2016–2026 · 12 conferences · across top CS/AI conferences

Achievements

+17 more ↓

🗺️ Taxonomy Completionist (22) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (5) 🐣 Hot Topic Early Bird

🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (22) 🏠 Conference Loyalist (26) 🤝 Dynamic Duo (20) 👑 Triple Crown 🧬 Topic Evolution 🏆 Grand Slam 🔬 Deep Specialist (13) 🏆 Keyword Champion (2) ❓ The Questioner 🚀 Conference Pioneer ⚡ Prolific Year (8) 🔥 Unstoppable (10) 🗃️ Keyword Collector (75) 💎 Century Club (46) 📈 Trend Setter

Conferences

INTERSPEECH (26) NIPS (6) ICLR (3) ACL (2) ECCV (2) ICML (2) AAAI (1) EMNLP (1) ICCV (1) IJCAI (1) NAACL (1) WACV (1)

Top co-authors

Jinyu Li (20) Yanzhi Wang (17) Zheng Zhan (13) Zhong Meng (10) Pu Zhao (9) Xue Lin (8) Wei Niu (8) Yushu Wu (7) Zhenglun Kong (7) Geng Yuan (7)

Keywords

speech recognition (9) word error rate (9) automatic speech recognition (9) recurrent neural network transducer (6) end-to-end speech recognition (6) model compression (5) acoustic model (4) deep neural network (4) long short-term memory (4) senone classification (3) domain adaptation (3) transformer transducer (3) adversarial learning (3) feature mapping (3) speaker adaptation (3) speech enhancement (3) semi-supervised learning (3) end-to-end model (3) generative model (2) image classification (2)

Papers

Influence-based Online Experience Selection for Effective RLHF ACL 2026 LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers AAAI 2025 Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation ACL 2025 FairSMOE: Mitigating Multi-Attribute Fairness Problem with Sparse Mixture-of-Experts IJCAI 2025 Can Adversarial Examples Be Parsed to Reveal Victim Model Information? WACV 2025 Sparse Learning for State Space Models on Mobile ICLR 2025 Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value NAACL 2024 Fast and Memory-Efficient Video Diffusion Using Streamlined Inference NIPS 2024 Exploring Token Pruning in Vision State Space Models NIPS 2024 Search for Efficient Large Language Models NIPS 2024 Efficient Training with Denoised Neural Weights ECCV 2024 Rethinking Token Reduction for State Space Models EMNLP 2024 E$^2$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation ICML 2024 NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription INTERSPEECH 2024 HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception NIPS 2023 DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning ICML 2023 Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors ICLR 2023 Reverse Engineering of Imperceptible Adversarial Image Perturbations ICLR 2022 Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution ECCV 2022 Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition INTERSPEECH 2022 SparCL: Sparse Continual Learning on the Edge NIPS 2022 Improving Multilingual Transformer Transducer Models by Reducing Language Confusions INTERSPEECH 2021 Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search ICCV 2021 Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS INTERSPEECH 2021 Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need INTERSPEECH 2021 Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems INTERSPEECH 2021 Streaming Multi-Talker Speech Recognition with Joint Speaker Identification INTERSPEECH 2021 Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition INTERSPEECH 2021 On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer INTERSPEECH 2021 MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge NIPS 2021 Bandpass Noise Generation and Augmentation for Unified ASR INTERSPEECH 2020 Combination of End-to-End and Hybrid Models for Speech Recognition INTERSPEECH 2020 1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM INTERSPEECH 2020 Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability INTERSPEECH 2020 Exploring Transformers for Large-Scale Speech Recognition INTERSPEECH 2020 Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator INTERSPEECH 2020 Acoustic-to-Phrase Models for Speech Recognition INTERSPEECH 2019 Self-Teaching Networks INTERSPEECH 2019 Speaker Adaptation for Attention-Based End-to-End Speech Recognition INTERSPEECH 2019 Layer Trajectory BLSTM INTERSPEECH 2019 Cycle-Consistent Speech Enhancement INTERSPEECH 2018 Layer Trajectory LSTM INTERSPEECH 2018 Adversarial Feature-Mapping for Speech Enhancement INTERSPEECH 2018 Large-Scale Domain Adaptation via Teacher-Student Learning INTERSPEECH 2017 Don’t Count on ASR to Transcribe for You: Breaking Bias with Two Crowds INTERSPEECH 2017 Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection INTERSPEECH 2017 Semi-Supervised Training in Deep Learning Acoustic Model INTERSPEECH 2016