Karan Dua
5 papers · 2025–2026 · 2 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (2) π Renaissance Researcher (6) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (21) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
EMNLP (3)
ACL (2)
Top co-authors
Keywords
large language model
(2)
synthetic data generation
(2)
multimodal large language model
(2)
multimodal learning
(2)
document understanding
(1)
visual reasoning
(1)
speech synthesis
(1)
multilingual corpus
(1)
vision-language model
(1)
visual context
(1)
multilingual datum
(1)
evaluation benchmark
(1)
multimodal reasoning
(1)
multimodal model
(1)
visual understanding
(1)
language identification
(1)
text-to-speech synthesis
(1)
human annotation
(1)
model evaluation
(1)
data augmentation
(1)
Papers
CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data
ACL 2026
SpeechWeave: Diverse Multilingual Synthetic Text & Audio Data Generation Pipeline for Training Text to Speech Models
ACL 2025
RCI: A Score for Evaluating Global and Local Reasoning in Multimodal Benchmarks
EMNLP 2025
PCRI: Measuring Context Robustness in Multimodal Models for Enterprise Applications
EMNLP 2025
FlexDoc: Parameterized Sampling for Diverse Multilingual Synthetic Documents for Training Document Understanding Models
EMNLP 2025