Ajay Divakaran

25 papers · 2013–2025 · 11 conferences · across top CS/AI conferences

Achievements

+13 more ↓

🌉 Interdisciplinary Bridge 🌈 Renaissance Researcher (6) 🏃 Academic Marathon (12) 🌍 Conference Polyglot (10) 🗺️ Taxonomy Completionist (46)

🌍 Conference Polyglot (10) 🏃 Academic Marathon (12) 🌉 Interdisciplinary Bridge 🔬 Deep Specialist (10) 🤝 Dynamic Duo (14) 🧬 Topic Evolution 📈 Trend Setter ❓ The Questioner (2) 💎 Century Club (24) 🚀 Conference Pioneer 🗃️ Keyword Collector (125) ⚡ Prolific Year (5) 🔥 Unstoppable (8)

Conferences

ICCV (5) IJCNLP (4) ACL (3) EMNLP (3) WACV (3) CVPR (2) AACL (1) EACL (1) ECCV (1) IJCAI (1) NAACL (1)

Top co-authors

Karan Sikka (14) Michael Cogswell (7) Pritish Sahu (6) Anirban Roy (4) Xiao Lin (4) Arijit Ray (3) Dimitra Vergyri (3) Anirudh Som (3) Giedrius Burachas (2) Stefan Lee (2)

Keywords

multimodal learning (5) visual question answering (4) zero-shot learning (3) large language model (3) answer consistency (2) graph neural network (2) deep multimodal classifier (2) intent detection (2) large vision-language model (2) vision-language model (2) content moderation (2) benchmark evaluation (2) question answering (2) multilingual nlp (2) data augmentation (2) chain-of-thought reasoning (2) text classification (2) question generation (2) dialogue system (2) weakly supervised learning (1)

Papers

Punching Bag vs. Punching Person: Motion Transferability in Videos ICCV 2025 MINDS: A Cross-Cultural Dialogue Corpus for Social Norm Classification and Adherence Detection AACL 2025 A Video is Worth 10000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval WACV 2025 MINDS: A Cross-Cultural Dialogue Corpus for Social Norm Classification and Adherence Detection IJCNLP 2025 DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback CVPR 2024 Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning ACL 2024 Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification EMNLP 2024 Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models NAACL 2024 BloomVQA: Assessing Hierarchical Multi-modal Comprehension ACL 2024 Multilingual Content Moderation: A Case Study on Reddit EACL 2023 TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models ICCV 2023 Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos CVPR 2023 Detecting Out-Of-Context Objects Using Graph Contextual Reasoning Network IJCAI 2022 Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark WACV 2022 Confidence Calibration for Domain Generalization Under Covariate Shift ICCV 2021 Comprehension Based Question Answering using Bloom’s Taxonomy ACL 2021 Comprehension Based Question Answering using Bloom’s Taxonomy IJCNLP 2021 Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation WACV 2020 Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment ICCV 2019 Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts IJCNLP 2019 Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts EMNLP 2019 Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation IJCNLP 2019 Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation EMNLP 2019 Zero-Shot Object Detection ECCV 2018 Dynamic Pooling for Complex Event Recognition ICCV 2013