Utkarsh Tyagi
18 papers · 2023–2026 · 6 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+7 more ↓ Show less ↑
🌍 Conference Polyglot (6) 🧭 Keyword Pioneer 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (14)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(14)
🤝
Dynamic Duo
(17)
💎
Century Club
(17)
⚡
Prolific Year
(7)
🗃️
Keyword Collector
(74)
❓
The Questioner
Conferences
EMNLP (5)
ACL (4)
ICLR (3)
NAACL (3)
INTERSPEECH (2)
ICCV (1)
Top co-authors
Keywords
data augmentation
(5)
multimodal learning
(3)
large language model
(3)
benchmark evaluation
(2)
low-resource setting
(2)
automatic speech recognition
(2)
visual cue
(2)
speech enhancement
(2)
abstract meaning representation
(1)
conditional generation
(1)
constrained generation
(1)
video understanding
(1)
speaker verification
(1)
natural language understanding
(1)
text classification
(1)
speech analysis
(1)
text generation
(1)
multimodal interaction
(1)
low-resource learning
(1)
named entity recognition
(1)
Papers
Audio MultiChallenge: A Multi-Turn Evaluation of Spoken Dialogue Systems on Natural Human Interaction
ACL 2026
EGOILLUSION: Benchmarking Hallucinations in Egocentric Video Understanding
EMNLP 2025
MULTIVOX: A Benchmark for Evaluating Voice Assistants for Multimodal Interactions
EMNLP 2025
MMAU: A Massive Multi-Task Audio Understanding and Reasoning Benchmark
ICLR 2025
Visual Description Grounding Reduces Hallucinations and Boosts Reasoning in LVLMs
ICLR 2025
ProSE: Diffusion Priors for Speech Enhancement
NAACL 2025
CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
ICLR 2024
ABEX: Data Augmentation for Low-Resource NLU via Expanding Abstract Descriptions
ACL 2024
ASPIRE: Language-Guided Data Augmentation for Improving Robustness Against Spurious Correlations
ACL 2024
GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities
EMNLP 2024
LipGER: Visually-Conditioned Generative Error Correction for Robust Automatic Speech Recognition
INTERSPEECH 2024
Do Vision-Language Models Understand Compound Nouns?
NAACL 2024
CoDa: Constrained Generation based Data Augmentation for Low-Resource NLP
NAACL 2024
CoSyn: Detecting Implicit Hate Speech in Online Conversations Using a Context Synergized Hyperbolic Network
EMNLP 2023
DALE: Generative Data Augmentation for Low-Resource Legal NLP
EMNLP 2023
ACLM: A Selective-Denoising based Generative Data Augmentation Approach for Low-Resource Complex NER
ACL 2023
AdVerb: Visually Guided Audio Dereverberation
ICCV 2023
MMER: Multimodal Multi-task Learning for Speech Emotion Recognition
INTERSPEECH 2023