Yashesh Gaur
18 papers · 2016–2024 · 1 conference · across top CS/AI conferences
Achievements
Jump to papers ↓+10 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (12) π Renaissance Researcher (5) π Interdisciplinary Bridge π£ Hot Topic Early Bird
π£
Hot Topic Early Bird
πΊοΈ
Taxonomy Completionist
(12)
π€
Dynamic Duo
(11)
π¬
Deep Specialist
(11)
π
Century Club
(18)
π
Conference Pioneer
π₯
Unstoppable
(6)
β‘
Prolific Year
(6)
ποΈ
Keyword Collector
(66)
π
Trend Setter
Conferences
INTERSPEECH (18)
Top co-authors
Keywords
automatic speech recognition
(9)
end-to-end model
(4)
word error rate
(3)
transformer transducer
(3)
end-to-end speech recognition
(3)
speaker counting
(3)
speech recognition
(3)
speech translation
(3)
speaker identification
(3)
attention-based encoder-decoder
(2)
serialized output training
(2)
multi-task learning
(2)
multimodal learning
(2)
overlapped speech recognition
(2)
federated learning
(2)
speaker diarization
(2)
neural transducer
(2)
language model
(2)
connectionist temporal classification
(1)
kullback-leibler divergence
(1)
Papers
Speech ReaLLM β Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time
INTERSPEECH 2024
COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
INTERSPEECH 2024
LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers
INTERSPEECH 2023
Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings
INTERSPEECH 2022
Streaming Multi-Talker ASR with Token-Level Serialized Output Training
INTERSPEECH 2022
Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
INTERSPEECH 2022
Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition
INTERSPEECH 2022
Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone
INTERSPEECH 2021
End-to-End Speaker-Attributed ASR with Transformer
INTERSPEECH 2021
Sequence-Level Self-Learning with Multiple Hypotheses
INTERSPEECH 2020
A Federated Approach in Training Acoustic Models
INTERSPEECH 2020
Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers
INTERSPEECH 2020
On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition
INTERSPEECH 2020
Serialized Output Training for End-to-End Overlapped Speech Recognition
INTERSPEECH 2020
Combination of End-to-End and Hybrid Models for Speech Recognition
INTERSPEECH 2020
Speaker Adaptation for Attention-Based End-to-End Speech Recognition
INTERSPEECH 2019
Acoustic-to-Phrase Models for Speech Recognition
INTERSPEECH 2019
Manipulating Word Lattices to Incorporate Human Corrections
INTERSPEECH 2016