Yashesh Gaur

18 papers · 2016–2024 · 1 conference · across top CS/AI conferences

Achievements

+10 more ↓

🧭 Keyword Pioneer 🗺️ Taxonomy Completionist (12) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐣 Hot Topic Early Bird

🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (12) 🤝 Dynamic Duo (11) 🔬 Deep Specialist (11) 💎 Century Club (18) 🚀 Conference Pioneer 🔥 Unstoppable (6) ⚡ Prolific Year (6) 🗃️ Keyword Collector (66) 📈 Trend Setter

Conferences

INTERSPEECH (18)

Top co-authors

Jinyu Li (11) Zhong Meng (9) Naoyuki Kanda (7) Zhuo Chen (6) Yu Wu (6) Takuya Yoshioka (6) Xiaofei Wang (6) Yifan Gong (4) Jian Wu (3) Shujie LIU (3)

Keywords

automatic speech recognition (9) end-to-end model (4) word error rate (3) transformer transducer (3) end-to-end speech recognition (3) speaker counting (3) speech recognition (3) speech translation (3) speaker identification (3) attention-based encoder-decoder (2) serialized output training (2) multi-task learning (2) multimodal learning (2) overlapped speech recognition (2) federated learning (2) speaker diarization (2) neural transducer (2) language model (2) connectionist temporal classification (1) kullback-leibler divergence (1)

Papers

Speech ReaLLM – Real-time Speech Recognition with Multimodal Language Models by Teaching the Flow of Time INTERSPEECH 2024 COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning INTERSPEECH 2024 LAMASSU: A Streaming Language-Agnostic Multilingual Speech Recognition and Translation Model Using Neural Transducers INTERSPEECH 2023 Streaming Speaker-Attributed ASR with Token-Level Speaker Embeddings INTERSPEECH 2022 Streaming Multi-Talker ASR with Token-Level Serialized Output Training INTERSPEECH 2022 Large-Scale Streaming End-to-End Speech Translation with Neural Transducers INTERSPEECH 2022 Internal Language Model Adaptation with Text-Only Data for End-to-End Speech Recognition INTERSPEECH 2022 Large-Scale Pre-Training of End-to-End Multi-Talker ASR for Meeting Transcription with Single Distant Microphone INTERSPEECH 2021 End-to-End Speaker-Attributed ASR with Transformer INTERSPEECH 2021 Sequence-Level Self-Learning with Multiple Hypotheses INTERSPEECH 2020 A Federated Approach in Training Acoustic Models INTERSPEECH 2020 Joint Speaker Counting, Speech Recognition, and Speaker Identification for Overlapped Speech of any Number of Speakers INTERSPEECH 2020 On the Comparison of Popular End-to-End Models for Large Scale Speech Recognition INTERSPEECH 2020 Serialized Output Training for End-to-End Overlapped Speech Recognition INTERSPEECH 2020 Combination of End-to-End and Hybrid Models for Speech Recognition INTERSPEECH 2020 Speaker Adaptation for Attention-Based End-to-End Speech Recognition INTERSPEECH 2019 Acoustic-to-Phrase Models for Speech Recognition INTERSPEECH 2019 Manipulating Word Lattices to Incorporate Human Corrections INTERSPEECH 2016