Yifan Gong
47 papers · 2016–2026 · 12 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+17 more ↓ Show less ↑
πΊοΈ Taxonomy Completionist (22) π§ Keyword Pioneer π Interdisciplinary Bridge π Renaissance Researcher (5) π£ Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(22)
π
Conference Loyalist
(26)
π€
Dynamic Duo
(20)
π
Triple Crown
π§¬
Topic Evolution
π
Grand Slam
π¬
Deep Specialist
(13)
π
Keyword Champion
(2)
β
The Questioner
π
Conference Pioneer
β‘
Prolific Year
(8)
π₯
Unstoppable
(10)
ποΈ
Keyword Collector
(75)
π
Century Club
(46)
π
Trend Setter
Conferences
INTERSPEECH (26)
NIPS (6)
ICLR (3)
ACL (2)
ECCV (2)
ICML (2)
AAAI (1)
EMNLP (1)
ICCV (1)
IJCAI (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
speech recognition
(9)
word error rate
(9)
automatic speech recognition
(9)
recurrent neural network transducer
(6)
end-to-end speech recognition
(6)
model compression
(5)
acoustic model
(4)
deep neural network
(4)
long short-term memory
(4)
senone classification
(3)
domain adaptation
(3)
transformer transducer
(3)
adversarial learning
(3)
feature mapping
(3)
speaker adaptation
(3)
speech enhancement
(3)
semi-supervised learning
(3)
end-to-end model
(3)
generative model
(2)
image classification
(2)
Papers
Influence-based Online Experience Selection for Effective RLHF
ACL 2026
LazyDiT: Lazy Learning for the Acceleration of Diffusion Transformers
AAAI 2025
Reasoning is All You Need for Video Generalization: A Counterfactual Benchmark with Sub-question Evaluation
ACL 2025
FairSMOE: Mitigating Multi-Attribute Fairness Problem with Sparse Mixture-of-Experts
IJCAI 2025
Can Adversarial Examples Be Parsed to Reveal Victim Model Information?
WACV 2025
Sparse Learning for State Space Models on Mobile
ICLR 2025
Value FULCRA: Mapping Large Language Models to the Multidimensional Spectrum of Basic Human Value
NAACL 2024
Fast and Memory-Efficient Video Diffusion Using Streamlined Inference
NIPS 2024
Exploring Token Pruning in Vision State Space Models
NIPS 2024
Search for Efficient Large Language Models
NIPS 2024
Efficient Training with Denoised Neural Weights
ECCV 2024
Rethinking Token Reduction for State Space Models
EMNLP 2024
E$^2$GAN: Efficient Training of Efficient GANs for Image-to-Image Translation
ICML 2024
NOTSOFAR-1 Challenge: New Datasets, Baseline, and Tasks for Distant Meeting Transcription
INTERSPEECH 2024
HotBEV: Hardware-oriented Transformer-based Multi-View 3D Detector for BEV Perception
NIPS 2023
DualHSIC: HSIC-Bottleneck and Alignment for Continual Learning
ICML 2023
Self-Ensemble Protection: Training Checkpoints Are Good Data Protectors
ICLR 2023
Reverse Engineering of Imperceptible Adversarial Image Perturbations
ICLR 2022
Compiler-Aware Neural Architecture Search for On-Mobile Real-Time Super-Resolution
ECCV 2022
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
INTERSPEECH 2022
SparCL: Sparse Continual Learning on the Edge
NIPS 2022
Improving Multilingual Transformer Transducer Models by Reducing Language Confusions
INTERSPEECH 2021
Achieving On-Mobile Real-Time Super-Resolution With Neural Architecture and Pruning Search
ICCV 2021
Improving RNN-T for Domain Scaling Using Semi-Supervised Training with Neural TTS
INTERSPEECH 2021
Rapid Speaker Adaptation for Conformer Transducer: Attention and Bias Are All You Need
INTERSPEECH 2021
Multiple Softmax Architecture for Streaming Multilingual End-to-End ASR Systems
INTERSPEECH 2021
Streaming Multi-Talker Speech Recognition with Joint Speaker Identification
INTERSPEECH 2021
Minimum Word Error Rate Training with Language Model Fusion for End-to-End Speech Recognition
INTERSPEECH 2021
On Minimum Word Error Rate Training of the Hybrid Autoregressive Transducer
INTERSPEECH 2021
MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge
NIPS 2021
Bandpass Noise Generation and Augmentation for Unified ASR
INTERSPEECH 2020
Combination of End-to-End and Hybrid Models for Speech Recognition
INTERSPEECH 2020
1-D Row-Convolution LSTM: Fast Streaming ASR at Accuracy Parity with LC-BLSTM
INTERSPEECH 2020
Developing RNN-T Models Surpassing High-Performance Hybrid Models with Customization Capability
INTERSPEECH 2020
Exploring Transformers for Large-Scale Speech Recognition
INTERSPEECH 2020
Rapid RNN-T Adaptation Using Personalized Speech Synthesis and Neural Language Generator
INTERSPEECH 2020
Acoustic-to-Phrase Models for Speech Recognition
INTERSPEECH 2019
Self-Teaching Networks
INTERSPEECH 2019
Speaker Adaptation for Attention-Based End-to-End Speech Recognition
INTERSPEECH 2019
Layer Trajectory BLSTM
INTERSPEECH 2019
Cycle-Consistent Speech Enhancement
INTERSPEECH 2018
Layer Trajectory LSTM
INTERSPEECH 2018
Adversarial Feature-Mapping for Speech Enhancement
INTERSPEECH 2018
Large-Scale Domain Adaptation via Teacher-Student Learning
INTERSPEECH 2017
Donβt Count on ASR to Transcribe for You: Breaking Bias with Two Crowds
INTERSPEECH 2017
Improving Mask Learning Based Speech Enhancement System with Restoration Layers and Residual Connection
INTERSPEECH 2017
Semi-Supervised Training in Deep Learning Acoustic Model
INTERSPEECH 2016