Utkarsh Tyagi

18 papers · 2023–2026 · 6 conferences · across top CS/AI conferences

Achievements

+7 more ↓

🌍 Conference Polyglot (6) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14)

🧭 Keyword Pioneer 🐝 Cross-Pollinator (14) 🤝 Dynamic Duo (17) 💎 Century Club (17) ⚡ Prolific Year (7) 🗃️ Keyword Collector (74) ❓ The Questioner

Conferences

EMNLP (5) ACL (4) ICLR (3) NAACL (3) INTERSPEECH (2) ICCV (1)

Top co-authors

Sreyan Ghosh (17) Dinesh Manocha (17) Sonal Kumar (15) S Sakshi (7) Ashish Seth (6) Ramani Duraiswami (6) Chandra Kiran Reddy Evuru (5) Oriol Nieto (4) Ramaneswaran S (4) Chandra Kiran Evuru (3)

Keywords

data augmentation (5) multimodal learning (3) large language model (3) benchmark evaluation (2) low-resource setting (2) automatic speech recognition (2) visual cue (2) speech enhancement (2) abstract meaning representation (1) conditional generation (1) constrained generation (1) video understanding (1) speaker verification (1) natural language understanding (1) text classification (1) speech analysis (1) text generation (1) multimodal interaction (1) low-resource learning (1) named entity recognition (1)

Papers

Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction ACL 2026 EGOILLUSION: Benchmarking Hallucinations in Egocentric Video Understanding EMNLP 2025 MULTIVOX: A Benchmark for Evaluating Voice Assistants for Multimodal Interactions EMNLP 2025 MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark ICLR 2025 Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs ICLR 2025 ProSE: Diffusion Priors for Speech Enhancement NAACL 2025 CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models ICLR 2024 ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions ACL 2024 ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations ACL 2024 GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities EMNLP 2024 LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition INTERSPEECH 2024 Do Vision-Language Models Understand Compound Nouns? NAACL 2024 CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP NAACL 2024 CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network EMNLP 2023 DALE: Generative Data Augmentation for Low-Resource Legal NLP EMNLP 2023 ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER ACL 2023 AdVerb: Visually Guided Audio Dereverberation ICCV 2023 MMER: Multimodal Multi-task Learning for Speech Emotion Recognition INTERSPEECH 2023