Ajay Divakaran
25 papers · 2013–2025 · 11 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+13 more ↓ Show less ↑
π Interdisciplinary Bridge π Renaissance Researcher (6) π Academic Marathon (12) π Conference Polyglot (10) πΊοΈ Taxonomy Completionist (46)
π
Conference Polyglot
(10)
π
Academic Marathon
(12)
π
Interdisciplinary Bridge
π¬
Deep Specialist
(10)
π€
Dynamic Duo
(14)
π§¬
Topic Evolution
π
Trend Setter
β
The Questioner
(2)
π
Century Club
(24)
π
Conference Pioneer
ποΈ
Keyword Collector
(125)
β‘
Prolific Year
(5)
π₯
Unstoppable
(8)
Conferences
ICCV (5)
IJCNLP (4)
ACL (3)
EMNLP (3)
WACV (3)
CVPR (2)
AACL (1)
EACL (1)
ECCV (1)
IJCAI (1)
NAACL (1)
Top co-authors
Keywords
multimodal learning
(5)
visual question answering
(4)
zero-shot learning
(3)
large language model
(3)
answer consistency
(2)
graph neural network
(2)
deep multimodal classifier
(2)
intent detection
(2)
large vision-language model
(2)
vision-language model
(2)
content moderation
(2)
benchmark evaluation
(2)
question answering
(2)
multilingual nlp
(2)
data augmentation
(2)
chain-of-thought reasoning
(2)
text classification
(2)
question generation
(2)
dialogue system
(2)
weakly supervised learning
(1)
Papers
Punching Bag vs. Punching Person: Motion Transferability in Videos
ICCV 2025
MINDS: A Cross-Cultural Dialogue Corpus for Social Norm Classification and Adherence Detection
AACL 2025
A Video is Worth 10000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval
WACV 2025
MINDS: A Cross-Cultural Dialogue Corpus for Social Norm Classification and Adherence Detection
IJCNLP 2025
DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
CVPR 2024
Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning
ACL 2024
Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification
EMNLP 2024
Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
NAACL 2024
BloomVQA: Assessing Hierarchical Multi-modal Comprehension
ACL 2024
Multilingual Content Moderation: A Case Study on Reddit
EACL 2023
TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
ICCV 2023
Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos
CVPR 2023
Detecting Out-Of-Context Objects Using Graph Contextual Reasoning Network
IJCAI 2022
Challenges in Procedural Multimodal Machine Comprehension: A Novel Way To Benchmark
WACV 2022
Confidence Calibration for Domain Generalization Under Covariate Shift
ICCV 2021
Comprehension Based Question Answering using Bloomβs Taxonomy
ACL 2021
Comprehension Based Question Answering using Bloomβs Taxonomy
IJCNLP 2021
Stacked Spatio-Temporal Graph Convolutional Networks for Action Segmentation
WACV 2020
Align2Ground: Weakly Supervised Phrase Grounding Guided by Image-Caption Alignment
ICCV 2019
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts
IJCNLP 2019
Integrating Text and Image: Determining Multimodal Document Intent in Instagram Posts
EMNLP 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
IJCNLP 2019
Sunny and Dark Outside?! Improving Answer Consistency in VQA through Entailed Question Generation
EMNLP 2019
Zero-Shot Object Detection
ECCV 2018
Dynamic Pooling for Complex Event Recognition
ICCV 2013