Jagadeesh Balam
14 papers · 2021–2025 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
🐝 Cross-Pollinator (14) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🌈 Renaissance Researcher (5)
🌍
Conference Polyglot
(4)
🤝
Dynamic Duo
(14)
🔥
Unstoppable
(5)
💎
Century Club
(14)
⚡
Prolific Year
(5)
🗃️
Keyword Collector
(65)
Conferences
INTERSPEECH (9)
ACL (2)
NAACL (2)
ICML (1)
Top co-authors
Keywords
automatic speech recognition
(5)
large language model
(5)
speaker verification
(2)
synthetic data generation
(2)
speech translation
(2)
speech recognition
(2)
speaker diarization
(2)
multimodal learning
(2)
end-to-end model
(2)
transfer learning
(2)
conversational ai
(1)
intent classification
(1)
speech enhancement
(1)
self-supervised learning
(1)
cross-modal learning
(1)
code generation
(1)
natural language inference
(1)
speech dereverberation
(1)
instruction tuning
(1)
speaker recognition
(1)
Papers
Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models
ACL 2025
NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model
ACL 2025
Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems
ICML 2025
Anticipating Future with Large Language Model for Simultaneous Machine Translation
NAACL 2025
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning
NAACL 2025
Instruction Data Generation and Unsupervised Adaptation for Speech Language Models
INTERSPEECH 2024
Schrödinger Bridge for Generative Speech Enhancement
INTERSPEECH 2024
Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations
INTERSPEECH 2024
Less is More: Accurate Speech Recognition & Translation without Web-Scale Data
INTERSPEECH 2024
Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling
INTERSPEECH 2023
A Compact End-to-End Model with Local and Global Context for Spoken Language Identification
INTERSPEECH 2023
NeMo Open Source Speaker Diarization System
INTERSPEECH 2022
Multi-scale Speaker Diarization with Dynamic Scale Weighting
INTERSPEECH 2022
SPGISpeech: 5,000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition
INTERSPEECH 2021