Zhifu Gao

12 papers · 2018–2025 · 3 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🏃 Academic Marathon (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (11)

🌍 Conference Polyglot (3) 🏃 Academic Marathon (7) 🏆 Keyword Champion (2) 🧬 Topic Evolution 🗃️ Keyword Collector (56) 💎 Century Club (12) 🔥 Unstoppable (8)

Conferences

INTERSPEECH (10) AAAI (1) ACL (1)

Top co-authors

Shiliang Zhang (9) Ian McLoughlin (6) Ziyang Ma (3) Haoneng Luo (3) Yan Song (3) Zhijie Yan (3) Xie Chen (3) Ming Lei (3) Guanrou Yang (2) Fan Yu (2)

Keywords

speech recognition (3) speaker verification (3) end-to-end speech recognition (3) automatic speech recognition (3) embedding learning (3) dilated convolution (2) large language model (2) end-to-end model (2) character error rate (2) continuous integrate-and-fire (2) discriminant analysis (1) mandarin speech (1) speech processing (1) weight sharing (1) speaker embedding (1) pretrained model (1) attention mechanism (1) parameter efficiency (1) cross-modal alignment (1) channel attention (1)

Papers

Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration AAAI 2025 emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation ACL 2024 MaLa-ASR: Multimedia-Assisted LLM-Based ASR INTERSPEECH 2024 FunASR: A Fundamental End-to-End Speech Recognition Toolkit INTERSPEECH 2023 Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System INTERSPEECH 2023 Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition INTERSPEECH 2022 Extremely Low Footprint End-to-End ASR System for Smart Device INTERSPEECH 2021 SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition INTERSPEECH 2020 Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition INTERSPEECH 2020 Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System INTERSPEECH 2019 An Effective Deep Embedding Learning Architecture for Speaker Verification INTERSPEECH 2019 An Improved Deep Embedding Learning Method for Short Duration Speaker Verification INTERSPEECH 2018