Zhifu Gao
12 papers · 2018–2025 · 3 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🏃 Academic Marathon (7) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (3) 🐝 Cross-Pollinator (11)
🌍
Conference Polyglot
(3)
🏃
Academic Marathon
(7)
🏆
Keyword Champion
(2)
🧬
Topic Evolution
🗃️
Keyword Collector
(56)
💎
Century Club
(12)
🔥
Unstoppable
(8)
Conferences
INTERSPEECH (10)
AAAI (1)
ACL (1)
Top co-authors
Keywords
speech recognition
(3)
speaker verification
(3)
end-to-end speech recognition
(3)
automatic speech recognition
(3)
embedding learning
(3)
dilated convolution
(2)
large language model
(2)
end-to-end model
(2)
character error rate
(2)
continuous integrate-and-fire
(2)
discriminant analysis
(1)
mandarin speech
(1)
speech processing
(1)
weight sharing
(1)
speaker embedding
(1)
pretrained model
(1)
attention mechanism
(1)
parameter efficiency
(1)
cross-modal alignment
(1)
channel attention
(1)
Papers
Speech Recognition Meets Large Language Model: Benchmarking, Models, and Exploration
AAAI 2025
emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation
ACL 2024
MaLa-ASR: Multimedia-Assisted LLM-Based ASR
INTERSPEECH 2024
FunASR: A Fundamental End-to-End Speech Recognition Toolkit
INTERSPEECH 2023
Accurate and Reliable Confidence Estimation Based on Non-Autoregressive End-to-End Speech Recognition System
INTERSPEECH 2023
Paraformer: Fast and Accurate Parallel Transformer for Non-autoregressive End-to-End Speech Recognition
INTERSPEECH 2022
Extremely Low Footprint End-to-End ASR System for Smart Device
INTERSPEECH 2021
SAN-M: Memory Equipped Self-Attention for End-to-End Speech Recognition
INTERSPEECH 2020
Streaming Chunk-Aware Multihead Attention for Online End-to-End Speech Recognition
INTERSPEECH 2020
Improving Aggregation and Loss Function for Better Embedding Learning in End-to-End Speaker Verification System
INTERSPEECH 2019
An Effective Deep Embedding Learning Architecture for Speaker Verification
INTERSPEECH 2019
An Improved Deep Embedding Learning Method for Short Duration Speaker Verification
INTERSPEECH 2018