conftrace_

Jagadeesh Balam

14 papers · 2021–2025 · 4 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+6 more ↓

🐝 Cross-Pollinator (14) 🌉 Interdisciplinary Bridge 🧭 Keyword Pioneer 🌍 Conference Polyglot (4) 🌈 Renaissance Researcher (5)

🌍 Conference Polyglot (4) 🤝 Dynamic Duo (14) 🔥 Unstoppable (5) 💎 Century Club (14) ⚡ Prolific Year (5) 🗃️ Keyword Collector (65)

Conferences

INTERSPEECH (9) ACL (2) NAACL (2) ICML (1)

Top co-authors

Boris Ginsburg (14) Nithin Rao Koluguri (6) Zhehuai Chen (5) Somshubra Majumdar (4) Kunal Dhawan (4) He Huang (4) Vitaly Lavrukhin (3) Piotr Żelasko (3) Vahid Noroozi (3) Krishna C Puvvada (3)

Keywords

automatic speech recognition (5) large language model (5) speaker verification (2) synthetic data generation (2) speech translation (2) speech recognition (2) speaker diarization (2) multimodal learning (2) end-to-end model (2) transfer learning (2) conversational ai (1) intent classification (1) speech enhancement (1) self-supervised learning (1) cross-modal learning (1) code generation (1) natural language inference (1) speech dereverberation (1) instruction tuning (1) speaker recognition (1)

Papers

Genetic Instruct: Scaling up Synthetic Generation of Coding Instructions for Large Language Models ACL 2025 NeKo: Cross-Modality Post-Recognition Error Correction with Tasks-Guided Mixture-of-Experts Language Model ACL 2025 Sortformer: A Novel Approach for Permutation-Resolved Speaker Supervision in Speech-to-Text Systems ICML 2025 Anticipating Future with Large Language Model for Simultaneous Machine Translation NAACL 2025 VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning NAACL 2025 Instruction Data Generation and Unsupervised Adaptation for Speech Language Models INTERSPEECH 2024 Schrödinger Bridge for Generative Speech Enhancement INTERSPEECH 2024 Codec-ASR: Training Performant Automatic Speech Recognition Systems with Discrete Speech Representations INTERSPEECH 2024 Less is More: Accurate Speech Recognition & Translation without Web-Scale Data INTERSPEECH 2024 Leveraging Pretrained ASR Encoders for Effective and Efficient End-to-End Speech Intent Classification and Slot Filling INTERSPEECH 2023 A Compact End-to-End Model with Local and Global Context for Spoken Language Identification INTERSPEECH 2023 NeMo Open Source Speaker Diarization System INTERSPEECH 2022 Multi-scale Speaker Diarization with Dynamic Scale Weighting INTERSPEECH 2022 SPGISpeech: 5,000 Hours of Transcribed Financial Audio for Fully Formatted End-to-End Speech Recognition INTERSPEECH 2021