Chitta Baral
99 papers · 2009–2026 · 14 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+15 more ↓ Show less ↑
🌍 Conference Polyglot (14) 🏃 Academic Marathon (17) 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🐝 Cross-Pollinator (12)
🧭
Keyword Pioneer
🐝
Cross-Pollinator
(12)
🏃
Academic Marathon
(17)
🏠
Conference Loyalist
(25)
🤝
Dynamic Duo
(24)
🔬
Deep Specialist
(20)
🧬
Topic Evolution
🏆
Keyword Champion
(6)
❓
The Questioner
(7)
📈
Trend Setter
🗃️
Keyword Collector
(380)
⚡
Prolific Year
(8)
🔥
Unstoppable
(8)
💎
Century Club
(99)
🚀
Conference Pioneer
Conferences
ACL (30)
EMNLP (25)
NAACL (15)
AAAI (5)
EACL (5)
IJCNLP (5)
CVPR (2)
ICCV (2)
ICLR (2)
IJCAI (2)
NIPS (2)
WACV (2)
AACL (1)
ECCV (1)
Top co-authors
Keywords
large language model
(14)
question answering
(14)
visual question answering
(8)
multimodal learning
(7)
vision-language model
(7)
language model
(6)
logical reasoning
(6)
few-shot learning
(6)
benchmark evaluation
(5)
natural language processing
(5)
zero-shot learning
(5)
data augmentation
(5)
multi-task learning
(5)
knowledge distillation
(5)
natural language inference
(5)
information retrieval
(4)
self-supervised learning
(4)
instruction tuning
(4)
adversarial robustness
(4)
diffusion model
(4)
Papers
The Perceptual Observatory Characterizing Robustness and Grounding in MLLMs
WACV 2026
PlanGEN: A Multi-Agent Framework for Generating Planning and Reasoning Trajectories for Complex Problem Solving
EMNLP 2025
AcT2I: Evaluating and Improving Action Depiction in Text-to-Image Models
EMNLP 2025
UnSeenTimeQA: Time-Sensitive Question-Answering Beyond LLMs’ Memorization
ACL 2025
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
AACL 2025
ToW: Thoughts of Words Improve Reasoning in Large Language Models
NAACL 2025
GETReason: Enhancing Image Context Extraction through Hierarchical Multi-Agent Reasoning
ACL 2025
Map&Make: Schema Guided Text to Table Generation
ACL 2025
Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation
NAACL 2025
Investigating the Shortcomings of LLMs in Step-by-Step Legal Reasoning
NAACL 2025
Insights into Alignment: Evaluating DPO and its Variants Across Multiple Tasks
ACL 2025
Hypothesis Generation for Materials Discovery and Design Using Goal-Driven and Constraint-Guided LLM Agents
NAACL 2025
Rethinking Information Synthesis in Multimodal Question Answering A Multi-Agent Perspective
IJCNLP 2025
VOILA: Evaluation of MLLMs For Perceptual Understanding and Analogical Reasoning
ICLR 2025
ActionReasoningBench: Reasoning about Actions with and without Ramification Constraints
ICLR 2025
RefEdit: A Benchmark and Method for Improving Instruction-based Image Editing Model on Referring Expressions
ICCV 2025
How Can Input Reformulation Improve Tool Usage Accuracy in a Complex Dynamic Environment? A Study on tau-bench
EMNLP 2025
QA‐LIGN: Aligning LLMs through Constitutionally Decomposed QA
EMNLP 2025
PLAN-TUNING: Post-Training Language Models to Learn Step-by-Step Planning for Complex Problem Solving
EMNLP 2025
ThinkTuning: Instilling Cognitive Reflections without Distillation
EMNLP 2025
Chaos with Keywords: Exposing Large Language Models Sycophancy to Misleading Keywords and Evaluating Defense Strategies
ACL 2024
ConceptBed: Evaluating Concept Learning Abilities of Text-to-Image Diffusion Models
AAAI 2024
EDM3: Event Detection as Multi-task Text Generation
NAACL 2024
Investigating Acceleration of LLaMA Inference by Enabling Intermediate Layer Decoding via Instruction Tuning with ‘LITE’
NAACL 2024
InstructABSA: Instruction Learning for Aspect Based Sentiment Analysis
NAACL 2024
Lost in Translation? Translation Errors and Challenges for Fair Assessment of Text-to-Image Models on Multilingual Concepts
NAACL 2024
Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models
EMNLP 2024
Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter?
EMNLP 2024
On the Robustness of Language Guidance for Low-Level Vision Tasks: Findings from Depth Estimation
CVPR 2024
ECLIPSE: A Resource-Efficient Text-to-Image Prior for Image Generations
CVPR 2024
TripletCLIP: Improving Compositional Reasoning of CLIP via Synthetic Vision-Language Negatives
NIPS 2024
LogicBench: Towards Systematic Evaluation of Logical Reasoning Ability of Large Language Models
ACL 2024
The Art of Defending: A Systematic Evaluation and Analysis of LLM Defense Strategies on Safety and Over-Defensiveness
ACL 2024
InstructExcel: A Benchmark for Natural Language Instruction in Excel
EMNLP 2023
Post-Abstention: Towards Reliably Re-Attempting the Abstained Instances in QA
ACL 2023
End-to-end Knowledge Retrieval with Multi-modal Queries
ACL 2023
A Study on the Efficiency and Generalization of Light Hybrid Retrievers
ACL 2023
A Unified Evaluation Framework for Novelty Detection and Accommodation in NLP with an Instantiation in Authorship Attribution
ACL 2023
“John is 50 years old, can his son be 65?” Evaluating NLP Models’ Understanding of Feasibility
EACL 2023
Don’t Blame the Annotator: Bias Already Starts in the Annotation Instructions
EACL 2023
Real-Time Visual Feedback to Guide Benchmark Creation: A Human-and-Metric-in-the-Loop Workflow
EACL 2023
How Many Data Samples is an Additional Instruction Worth?
EACL 2023
LogicAttack: Adversarial Attacks for Evaluating Logical Consistency of Natural Language Inference
EMNLP 2023
Improving Diversity With Adversarially Learned Transformations for Domain Generalization
WACV 2023
Semantically Distributed Robust Optimization for Vision-and-Language Inference
ACL 2022
Reframing Instructional Prompts to GPTk’s Language
ACL 2022
To Find Waldo You Need Contextual Cues: Debiasing Who’s Waldo
ACL 2022
Choose Your QA Model Wisely: A Systematic Study of Generative and Extractive Readers for Question Answering
ACL 2022
Let the Model Decide its Curriculum for Multitask Learning
NAACL 2022
In-BoXBART: Get Instructions into Biomedical Multi-Task Learning
NAACL 2022
A Simple Approach to Jointly Rank Passages and Select Relevant Sentences in the OBQA Context
NAACL 2022
ILDAE: Instance-Level Difficulty Analysis of Evaluation Data
ACL 2022
Improving Biomedical Information Retrieval with Neural Retrievers
AAAI 2022
Towards Improving Selective Prediction Ability of NLP Systems
ACL 2022
Less is More: Summary of Long Instructions is Better for Program Synthesis
EMNLP 2022
Is a Question Decomposition Unit All We Need?
EMNLP 2022
LILA: A Unified Benchmark for Mathematical Reasoning
EMNLP 2022
CRIPP-VQA: Counterfactual Reasoning about Implicit Physical Properties via Video Question Answering
EMNLP 2022
Model Cascading: Towards Jointly Improving Efficiency and Accuracy of NLP Systems
EMNLP 2022
Learning Action-Effect Dynamics for Hypothetical Vision-Language Reasoning Task
EMNLP 2022
Lexi: Self-Supervised Learning of the UI Language
EMNLP 2022
Cross-Task Generalization via Natural Language Crowdsourcing Instructions
ACL 2022
NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning Tasks
ACL 2022
Generalized but not Robust? Comparing the Effects of Data Modification Methods on Out-of-Domain Generalization and Adversarial Robustness
ACL 2022
Unsupervised Natural Language Inference Using PHL Triplet Generation
ACL 2022
Investigating Selective Prediction Approaches Across Several Tasks in IID, OOD, and Adversarial Settings
ACL 2022
Attribute-Guided Adversarial Training for Robustness to Natural Perturbations
AAAI 2021
Weakly-Supervised Visual-Retriever-Reader for Knowledge-based Question Answering
EMNLP 2021
Investigating Numeracy Learning Ability of a Text-to-Text Transfer Model
EMNLP 2021
Weakly Supervised Relative Spatial Reasoning for Visual Question Answering
ICCV 2021
Constructing Flow Graphs from Procedural Cybersecurity Texts
ACL 2021
WeaQA: Weak Supervision via Captions for Visual Question Answering
ACL 2021
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction
IJCNLP 2021
WeaQA: Weak Supervision via Captions for Visual Question Answering
IJCNLP 2021
Constructing Flow Graphs from Procedural Cybersecurity Texts
IJCNLP 2021
Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction
ACL 2021
Self-Supervised Test-Time Learning for Reading Comprehension
NAACL 2021
CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images
NAACL 2021
‘Just because you are right, doesn’t mean I am wrong’: Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks
EACL 2021
MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question Answering
EMNLP 2020
Deeply Embedded Knowledge Representation & Reasoning For Natural Language Question Answering: A Practitioner’s Perspective
EMNLP 2020
Enhancing Natural Language Inference Using New and Expanded Training Data Sets and New Learning Models
AAAI 2020
Language-Conditioned Imitation Learning for Robot Manipulation Tasks
NIPS 2020
Self-Supervised Knowledge Triplet Learning for Zero-Shot Question Answering
EMNLP 2020
Video2Commonsense: Generating Commonsense Descriptions to Enrich Video Captioning
EMNLP 2020
VQA-LOL: Visual Question Answering under the Lens of Logic
ECCV 2020
Visuo-Linguistic Question Answering (VLQA) Challenge
EMNLP 2020
Declarative Question Answering over Knowledge Bases Containing Natural Language Text with Answer Set Programming
AAAI 2019
Careful Selection of Knowledge to Solve Open Book Question Answering
ACL 2019
Combining Knowledge Hunting and Neural Language Models to Solve the Winograd Schema Challenge
ACL 2019
Identification of Adverse Drug Reaction Mentions in Tweets – SMM4H Shared Task 2019
ACL 2019
Integrating Knowledge and Reasoning in Image Understanding
IJCAI 2019
Learning To Use Formulas To Solve Simple Arithmetic Problems
ACL 2016
The NL2KR Platform for building Natural Language Translation Systems
IJCNLP 2015
Learning to Automatically Solve Logic Grid Puzzles
EMNLP 2015
Recognizing Social Constructs from Textual Conversation
NAACL 2015
The NL2KR Platform for building Natural Language Translation Systems
ACL 2015
Towards Addressing the Winograd Schema Challenge — Building and Using a Semantic Parser and a Knowledge Hunting Module
IJCAI 2015
Towards Effective Sentence Simplification for Automatic Processing of Biomedical Text
NAACL 2009