Mohit Bansal
313 papers · 2008–2026 · 16 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+20 more ↓ Show less ↑
π§ Keyword Pioneer πΊοΈ Taxonomy Completionist (24) π Interdisciplinary Bridge π Renaissance Researcher (6) π£ Hot Topic Early Bird
π§
Keyword Pioneer
π
Renaissance Researcher
(6)
π
Interdisciplinary Bridge
π
Keyword Trendsetter Combo
(7)
π
Conference Loyalist
(21)
π
The Namer
π₯
Mega-Team
(71)
π±
Topic Pioneer
π¬
Deep Specialist
(63)
π
Keyword Champion
(9)
π
Triple Crown
π
Grand Slam
π€
Dynamic Duo
(26)
π
Trend Setter
ποΈ
Keyword Collector
(55)
β‘
Prolific Year
(42)
π
Century Club
(305)
β
The Questioner
(16)
π₯
Unstoppable
(13)
π
Conference Pioneer
Conferences
EMNLP (68)
ACL (66)
NAACL (51)
NIPS (21)
ICLR (20)
CVPR (18)
IJCNLP (15)
EACL (13)
AAAI (12)
ICCV (6)
ICML (5)
COLING (4)
CONLL (4)
ECCV (4)
WACV (4)
IJCAI (2)
Top co-authors
Research topics
Keywords
multimodal learning
(34)
large language model
(27)
text generation
(18)
natural language inference
(16)
abstractive summarization
(15)
video understanding
(15)
question answering
(14)
data augmentation
(14)
visual question answering
(12)
reinforcement learning
(12)
knowledge graph
(11)
language model
(11)
transfer learning
(11)
video question answering
(11)
multi-task learning
(11)
video captioning
(9)
attention mechanism
(9)
few-shot learning
(9)
commonsense reasoning
(9)
image captioning
(8)
Papers
GrAInS: Gradient-based Attribution for Inference-Time Steering of LLMs and VLMs
ACL 2026
TimeRefine: Temporal Grounding with Time Refining Video LLM
WACV 2026
PRInTS: Reward Modeling for Long-Horizon Information Seeking
ACL 2026
Routing with Generated Data: Annotation-Free LLM Skill Estimation and Expert Selection
ACL 2026
PrefixNLI: Detecting Factual Inconsistencies as Soon as They Arise
ACL 2026
DreamRunner: Fine-Grained Compositional Story-to-Video Generation with Retrieval-Augmented Motion Adaptation
AAAI 2026
Instruction Tuning with and without Context: Behavioral Shifts and Downstream Impact
EACL 2026
RotBench: Evaluating Multi-modal Large Language Models on Identifying Image Rotation
EACL 2026
DART: Leveraging Multi-Agent Disagreement for Tool Recruitment in Multimodal Reasoning
EACL 2026
Teaching Models to Balance Resisting and Accepting Persuasion
NAACL 2025
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level
CVPR 2025
SAME: Learning Generic Language-Guided Visual Navigation with State-Adaptive Mixture of Experts
ICCV 2025
VEGGIE: Instructional Editing and Reasoning Video Concepts with Grounded Generation
ICCV 2025
CAPTURE: Evaluating Spatial Reasoning in Vision Language Models via Occluded Object Counting
ICCV 2025
FLAMES: Improving LLM Math Reasoning via a Fine-Grained Analysis of the Data Synthesis Pipeline
EMNLP 2025
MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
EMNLP 2025
Video-Skill-CoT: Skill-based Chain-of-Thoughts for Domain-Adaptive Video Reasoning
EMNLP 2025
Language Models Identify Ambiguities and Exploit Loopholes
EMNLP 2025
MAgICoRe: Multi-Agent, Iterative, Coarse-to-Fine Refinement for Reasoning
EMNLP 2025
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
EMNLP 2025
RACCooN: Versatile Instructional Video Editing with Auto-Generated Narratives
EMNLP 2025
Glider: Global and Local Instruction-Driven Expert Router
EMNLP 2025
Multi-Attribute Steering of Language Models via Targeted Intervention
ACL 2025
CoKe: Customizable Fine-Grained Story Evaluation via Chain-of-Keyword Rationalization
ACL 2025
LAQuer: Localized Attribution Queries in Content-grounded Generation
ACL 2025
Self-Consistency Preference Optimization
ICML 2025
VideoTree: Adaptive Tree-based Video Representation for LLM Reasoning on Long Videos
CVPR 2025
DAM: Dynamic Adapter Merging for Continual Video QA Learning
WACV 2025
VEDIT: Latent Prediction Architecture For Procedural Video Representation Learning
ICLR 2025
Bootstrapping Language-Guided Navigation Learning with Self-Refining Data Flywheel
ICLR 2025
Unbounded: A Generative Infinite Game of Character Life Simulation
ICLR 2025
Adapt-$\infty$: Scalable Continual Multimodal Instruction Tuning via Dynamic Data Selection
ICLR 2025
SAFREE: Training-Free and Adaptive Guard for Safe Text-to-Image And Video Generation
ICLR 2025
CREMA: Generalizable and Efficient Video-Language Reasoning via Multimodal Modular Fusion
ICLR 2025
Ctrl-Adapter: An Efficient and Versatile Framework for Adapting Diverse Controls to Any Diffusion Model
ICLR 2025
Anyprefer: An Agentic Framework for Preference Data Synthesis
ICLR 2025
System 1.x: Learning to Balance Fast and Slow Planning with Language Models
ICLR 2025
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback
ICLR 2025
See It from My Perspective: How Language Affects Cultural Bias in Image Understanding
ICLR 2025
Improving Faithfulness of Text-to-Image Diffusion Models through Inference Intervention
WACV 2025
AdaCAD: Adaptively Decoding to Balance Conflicts between Contextual and Parametric Knowledge
NAACL 2025
MAMM-Refine: A Recipe for Improving Faithfulness in Generation with Multi-Agent Collaboration
NAACL 2025
On Positional Bias of Faithfulness for Long-form Summarization
NAACL 2025
Reverse Thinking Makes LLMs Stronger Reasoners
NAACL 2025
Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
ICLR 2024
SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data
NIPS 2024
Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences
ACL 2024
The Unreasonable Effectiveness of Easy Training Data for Hard Tasks
ACL 2024
ReConcile: Round-Table Conference Improves Reasoning via Consensus among Diverse LLMs
ACL 2024
Inducing Systematicity in Transformers by Attending to Structurally Quantized Embeddings
ACL 2024
REFINESUMM: Self-Refining MLLM for Generating a Multimodal Summarization Dataset
ACL 2024
Evaluating Very Long-Term Conversational Memory of LLM Agents
ACL 2024
Soft Self-Consistency Improves Language Models Agents
ACL 2024
The Power of Summary-Source Alignments
ACL 2024
ACUEval: Fine-grained Hallucination Evaluation and Correction for Abstractive Summarization
ACL 2024
Unified Embeddings for Multimodal Retrieval via Frozen LLMs
EACL 2024
Multimodal Representation Learning by Alternating Unimodal Adaptation
CVPR 2024
Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts
CVPR 2024
CoDi-2: In-Context Interleaved and Interactive Any-to-Any Generation
CVPR 2024
LACIE: Listener-Aware Finetuning for Calibration in Large Language Models
NIPS 2024
ADaPT: As-Needed Decomposition and Planning with Language Models
NAACL 2024
Prompting Vision-Language Models For Aspect-Controlled Generation of Referring Expressions
NAACL 2024
VLN-Video: Utilizing Driving Videos for Outdoor Vision-and-Language Navigation
AAAI 2024
Branch-Solve-Merge Improves Large Language Model Evaluation and Generation
NAACL 2024
ReGAL: Refactoring Programs to Discover Generalizable Abstractions
ICML 2024
Position: TrustLLM: Trustworthiness in Large Language Models
ICML 2024
MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models
ICML 2024
Analyzing and Mitigating Object Hallucination in Large Vision-Language Models
ICLR 2024
ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models
ICLR 2024
GTBench: Uncovering the Strategic Reasoning Capabilities of LLMs via Game-Theoretic Evaluations
NIPS 2024
Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
ICLR 2024
$\mathbb{D}^2$ Pruning: Message Passing for Balancing Diversity & Difficulty in Data Pruning
ICLR 2024
Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy
ICLR 2024
Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation
ICLR 2024
Knowledge-Aware Reasoning over Multimodal Semi-structured Tables
EMNLP 2024
LLM Self-Correction with DeCRIM: Decompose, Critique, and Refine for Enhanced Following of Instructions with Multiple Constraints
EMNLP 2024
A Simple LLM Framework for Long-Range Video Question-Answering
EMNLP 2024
Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM
EMNLP 2024
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training
ECCV 2024
Hierarchical and Dynamic Prompt Compression for Efficient Zero-shot API Usage
EACL 2024
On Conditional and Compositional Language Model Differentiable Prompting
IJCAI 2023
Visual Programming for Step-by-Step Text-to-Image Generation and Evaluation
NIPS 2023
TIES-Merging: Resolving Interference When Merging Models
NIPS 2023
Any-to-Any Generation via Composable Diffusion
NIPS 2023
Does Localization Inform Editing? Surprising Differences in Causality-Based Localization vs. Knowledge Editing in Language Models
NIPS 2023
Paxion: Patching Action Knowledge in Video-Language Foundation Models
NIPS 2023
PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation
NIPS 2023
Adaptive Contextual Perception: How To Generalize To New Backgrounds and Ambiguous Objects
NIPS 2023
Can Language Models Teach? Teacher Explanations Improve Student Performance via Personalization
NIPS 2023
Self-Chained Image-Language Model for Video Localization and Question Answering
NIPS 2023
Revealing Single Frame Bias for Video-and-Language Learning
ACL 2023
Extractive is not Faithful: An Investigation of Broad Unfaithfulness Problems in Extractive Summarization
ACL 2023
Non-Sequential Graph Script Induction via Multimedia Grounding
ACL 2023
MixCE: Training Autoregressive Language Models by Mixing Forward and Reverse Cross-Entropies
ACL 2023
MeetingQA: Extractive Question-Answering on Meeting Transcripts
ACL 2023
Exploring Continual Learning for Code Generation Models
ACL 2023
Exclusive Supermask Subnetwork Training for Continual Learning
ACL 2023
Evaluating the Factual Consistency of Large Language Models Through News Summarization
ACL 2023
MURMUR: Modular Multi-Step Reasoning for Semi-Structured Data-to-Text Generation
ACL 2023
Can Sequence-to-Sequence Transformers Naturally Understand Sequential Instructions?
ACL 2023
Hierarchical Video-Moment Retrieval and Step-Captioning
CVPR 2023
Unifying Vision, Text, and Layout for Universal Document Processing
CVPR 2023
Vision Transformers Are Parameter-Efficient Audio-Visual Learners
CVPR 2023
VindLU: A Recipe for Effective Video-and-Language Pretraining
CVPR 2023
Improving Vision-and-Language Navigation by Generating Future-View Image Semantics
CVPR 2023
Enhancing Multi-Document Summarization with Cross-Document Graph-based Information Extraction
EACL 2023
Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models
EACL 2023
Faithfulness-Aware Decoding Strategies for Abstractive Summarization
EACL 2023
DeepMaven: Deep Question Answering on Long-Distance Movie/TV Show Videos with Multimedia Knowledge Extraction and Synthesis
EACL 2023
Social Commonsense for Explanation and Cultural Bias Discovery
EACL 2023
GrIPS: Gradient-free, Edit-based Instruction Search for Prompting Large Language Models
EACL 2023
HistAlign: Improving Context Dependency in Language Generation by Aligning with History
EMNLP 2023
ReCEval: Evaluating Reasoning Chains via Correctness and Informativeness
EMNLP 2023
Generating Summaries with Controllable Readability Levels
EMNLP 2023
Data Factors for Better Compositional Generalization
EMNLP 2023
An Empirical Study of Multimodal Model Merging
EMNLP 2023
Debiasing Multimodal Models via Causal Information Minimization
EMNLP 2023
Unified Coarse-to-Fine Alignment for Video-Text Retrieval
ICCV 2023
Scaling Data Generation in Vision-and-Language Navigation
ICCV 2023
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models
ICCV 2023
Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees
ICLR 2023
Perceiver-VL: Efficient Vision-and-Language Modeling With Iterative Latent Attention
WACV 2023
Evaluating and Improving Factuality in Multimodal Abstractive Summarization
EMNLP 2022
ALFRED-L: Investigating the Role of Language for Action Learning in Interactive Visual Environments
EMNLP 2022
Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated Explanations
EMNLP 2022
GraDA: Graph Generative Data Augmentation for Commonsense Reasoning
COLING 2022
CAISE: Conversational Agent for Image Search and Editing
AAAI 2022
GRAVL-BERT: Graphical Visual-Linguistic Representations for Multimodal Coreference Resolution
COLING 2022
MuMuQA: Multimedia Multi-Hop News Question Answering via Cross-Media Knowledge Extraction and Grounding
AAAI 2022
When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data
ACL 2022
GraDA: Graph Generative Data Augmentation for Commonsense Reasoning
NAACL 2022
ECLIPSE: Efficient Long-Range Video Retrieval Using Sight and Sound
ECCV 2022
StoryDALL-E: Adapting Pretrained Text-to-Image Transformers for Story Continuation
ECCV 2022
On the Limits of Evaluating Embodied Agent Model Generalization Using Validation Sets
ACL 2022
Efficient Few-Shot Fine-Tuning for Opinion Summarization
NAACL 2022
EnvEdit: Environment Editing for Vision-and-Language Navigation
CVPR 2022
VL-Adapter: Parameter-Efficient Transfer Learning for Vision-and-Language Tasks
CVPR 2022
CLEAR: Improving Vision-Language Navigation with Cross-Lingual, Environment-Agnostic Representations
NAACL 2022
Fine-grained Image Captioning with CLIP Reward
NAACL 2022
Multimodal Intent Discovery from Livestream Videos
NAACL 2022
SETSum: Summarization and Visualization of Student Evaluations of Teaching
NAACL 2022
RESIN-11: Schema-guided Event Prediction for 11 Newsworthy Scenarios
NAACL 2022
FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations
NAACL 2022
Enhancing Knowledge Selection for Grounded Dialogues via Document Semantic Graphs
NAACL 2022
Interactive Query-Assisted Summarization via Deep Reinforcement Learning
NAACL 2022
Proposition-Level Clustering for Multi-Document Summarization
NAACL 2022
Masked Part-Of-Speech Model: Does Modeling Long Context Help Unsupervised POS-tagging?
NAACL 2022
FactPEGASUS: Factuality-Aware Pre-training and Fine-tuning for Abstractive Summarization
NAACL 2022
On Curriculum Learning for Commonsense Reasoning
NAACL 2022
CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination
NAACL 2022
How Much Can CLIP Benefit Vision-and-Language Tasks?
ICLR 2022
Few-Shot Parameter-Efficient Fine-Tuning is Better and Cheaper than In-Context Learning
NIPS 2022
Language Models with Image Descriptors are Strong Few-Shot Video-Language Learners
NIPS 2022
TVLT: Textless Vision-Language Transformer
NIPS 2022
Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive Learning
ACL 2022
How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee Language
ACL 2022
Distributed NLI: Learning to Predict Human Opinion Distributions for Language Reasoning
ACL 2022
LST: Ladder Side-Tuning for Parameter and Memory Efficient Transfer Learning
NIPS 2022
VisFIS: Visual Feature Importance Supervision with Right-for-the-Right-Reason Objectives
NIPS 2022
WinoGAViL: Gamified Association Benchmark to Challenge Vision-and-Language Models
NIPS 2022
Analyzing the Limits of Self-Supervision in Handling Bias in Language
EMNLP 2022
Mutual Exclusivity Training and Primitive Augmentation to Induce Compositionality
EMNLP 2022
Data Augmentation for Abstractive Query-Focused Multi-Document Summarization
AAAI 2021
FIXMYPOSE: Pose Correctional Captioning and Retrieval
AAAI 2021
Detecting Moments and Highlights in Videos via Natural Language Queries
NIPS 2021
Improving and Simplifying Pattern Exploiting Training
EMNLP 2021
Continual Few-Shot Learning for Text Classification
EMNLP 2021
Inducing Transformerβs Compositional Generalization Ability via Auxiliary Sequence Prediction Tasks
EMNLP 2021
NDH-Full: Learning and Evaluating Navigational Agents on Full-Length Dialogue
EMNLP 2021
Finding a Balanced Degree of Automation for Summary Evaluation
EMNLP 2021
Integrating Visuospatial, Linguistic, and Commonsense Structure into Story Visualization
EMNLP 2021
ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
EMNLP 2021
FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging
EMNLP 2021
iFacetSum: Coreference-based Interactive Faceted Summarization for Multi-Document Exploration
EMNLP 2021
Learning and Analyzing Generation Order for Undirected Sequence Models
EMNLP 2021
To what extent do human explanations of model behavior align with actual model behavior?
EMNLP 2021
Summary-Source Proposition-level Alignment: Task, Datasets and Supervised Baseline
EMNLP 2021
Disentangling Online Chats with DAG-structured LSTMs
ACL 2021
An Overview of Uncertainty Calibration for Text Classification and the Role of Distillation
ACL 2021
Analysis of Tree-Structured Architectures for Code Generation
ACL 2021
ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality Estimation and Corrective Feedback
ACL 2021
mTVR: Multilingual Moment Retrieval in Videos
ACL 2021
EmailSum: Abstractive Email Thread Summarization
ACL 2021
Continuous Language Generative Flow
ACL 2021
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
ACL 2021
InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection
ACL 2021
VidLanKD: Improving Language Understanding via Video-Distilled Knowledge Transfer
NIPS 2021
Unifying Vision-and-Language Tasks via Text Generation
ICML 2021
The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations
NIPS 2021
InfoSurgeon: Cross-Media Fine-grained Information Consistency Checking for Fake News Detection
IJCNLP 2021
I like fish, especially dolphins: Addressing Contradictions in Dialogue Modeling
IJCNLP 2021
Continuous Language Generative Flow
IJCNLP 2021
EmailSum: Abstractive Email Thread Summarization
IJCNLP 2021
mTVR: Multilingual Moment Retrieval in Videos
IJCNLP 2021
ChrEnTranslate: Cherokee-English Machine Translation Demo with Quality Estimation and Corrective Feedback
IJCNLP 2021
Analysis of Tree-Structured Architectures for Code Generation
IJCNLP 2021
An Overview of Uncertainty Calibration for Text Classification and the Role of Distillation
IJCNLP 2021
Disentangling Online Chats with DAG-structured LSTMs
IJCNLP 2021
Extending Multi-Document Summarization Evaluation to the Interactive Setting
NAACL 2021
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information
NAACL 2021
DeCEMBERT: Learning from Noisy Instructional Videos via Dense Captions and Entropy Minimization
NAACL 2021
Improving Generation and Evaluation of Visual Stories via Semantic Consistency
NAACL 2021
multiPRover: Generating Multiple Proofs for Improved Interpretability in Rule Reasoning
NAACL 2021
Dynabench: Rethinking Benchmarking in NLP
NAACL 2021
Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters
NAACL 2021
Enriching Transformers with Structured Tensor-Product Representations for Abstractive Summarization
NAACL 2021
Robustness Gym: Unifying the NLP Evaluation Landscape
NAACL 2021
ERNIE-NLI: Analyzing the Impact of Domain-Specific External Knowledge on Enhanced Representations for NLI
NAACL 2021
The Effect of Pretraining on Extractive Summarization for Scientific Documents
NAACL 2021
GENE: Global Event Network Embedding
NAACL 2021
Summary-Source Proposition-level Alignment: Task, Datasets and Supervised Baseline
CONLL 2021
Less Is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
CVPR 2021
Identify, Align, and Integrate: Matching Knowledge Graphs to Commonsense Reasoning Tasks
EACL 2021
Hidden Biases in Unreliable News Detection Datasets
EACL 2021
TVQA+: Spatio-Temporal Grounding for Video Question Answering
ACL 2020
What is More Likely to Happen Next? Video-and-Language Future Event Prediction
EMNLP 2020
What Can We Learn from Collective Human Opinions on Natural Language Inference Data?
EMNLP 2020
FENAS: Flexible and Expressive Neural Architecture Search
EMNLP 2020
Towards Robustifying NLI Models Against Lexical Dataset Biases
ACL 2020
AvgOut: A Simple Output-Probability Measure to Eliminate Dull Responses
AAAI 2020
HoVer: A Dataset for Many-Hop Fact Extraction And Claim Verification
EMNLP 2020
Adversarial Augmentation Policy Search for Domain and Cross-Lingual Generalization in Reading Comprehension
EMNLP 2020
ManyModalQA: Modality Disambiguation and QA over Diverse Inputs
AAAI 2020
TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
ECCV 2020
Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?
EMNLP 2020
ArraMon: A Joint Navigation-Assembly Instruction Interpretation Task in Dynamic Environments
EMNLP 2020
Simple Compounded-Label Training for Fact Extraction and Verification
ACL 2020
Modality-Balanced Models for Visual Dialogue
AAAI 2020
PRover: Proof Generation for Interpretable Reasoning over Rules
EMNLP 2020
ChrEn: Cherokee-English Machine Translation for Endangered Language Revitalization
EMNLP 2020
Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision
EMNLP 2020
DORB: Dynamically Optimizing Multiple Rewards with Bandits
EMNLP 2020
The Curse of Performance Instability in Analysis Datasets: Consequences, Source, and Suggestions
EMNLP 2020
Diagnosing the Environment Bias in Vision-and-Language Navigation
IJCAI 2020
ConjNLI: Natural Language Inference Over Conjunctive Sentences
EMNLP 2020
Multi-Source Domain Adaptation for Text Classification via DistanceNet-Bandits
AAAI 2020
MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
ACL 2020
Dense-Caption Matching and Frame-Selection Gating for Temporal Localization in VideoQA
ACL 2020
Adversarial NLI: A New Benchmark for Natural Language Understanding
ACL 2020
Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?
ACL 2020
Improving Visual Question Answering by Referring to Generated Paragraph Captions
ACL 2019
Avoiding Reasoning Shortcuts: Adversarial Evaluation, Training, and Model Development for Multi-Hop QA
ACL 2019
Explore, Propose, and Assemble: An Interpretable Model for Multi-Hop Reading Comprehension
ACL 2019
PaperRobot: Incremental Draft Generation of Scientific Ideas
ACL 2019
Continual and Multi-Task Architecture Search
ACL 2019
Expressing Visual Relationships via Language
ACL 2019
Automatically Learning Data Augmentation Policies for Dialogue Tasks
IJCNLP 2019
Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering
IJCNLP 2019
Revealing the Importance of Semantic Retrieval for Machine Reading at Scale
IJCNLP 2019
Self-Assembling Modular Networks for Interpretable Multi-Hop Reasoning
IJCNLP 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
IJCNLP 2019
Proceedings of the 23rd Conference on Computational Natural Language Learning (CoNLL)
CONLL 2019
Crowdsourcing Lightweight Pyramids for Manual Summary Evaluation
NAACL 2019
Learning to Navigate Unseen Environments: Back Translation with Environmental Dropout
NAACL 2019
AutoSeM: Automatic Task Selection and Mixing in Multi-Task Learning
NAACL 2019
Multi-Target Embodied Question Answering
CVPR 2019
Analyzing Compositionality-Sensitivity of NLI Models
AAAI 2019
Combining Fact Extraction and Verification with Neural Semantic Matching Networks
AAAI 2019
Automatically Learning Data Augmentation Policies for Dialogue Tasks
EMNLP 2019
LXMERT: Learning Cross-Modality Encoder Representations from Transformers
EMNLP 2019
Self-Assembling Modular Networks for Interpretable Multi-Hop Reasoning
EMNLP 2019
Revealing the Importance of Semantic Retrieval for Machine Reading at Scale
EMNLP 2019
Addressing Semantic Drift in Question Generation for Semi-Supervised Question Answering
EMNLP 2019
Commonsense for Generative Multi-Hop Question Answering Tasks
EMNLP 2018
Soft Layer-Specific Multi-Task Summarization with Entailment and Question Generation
ACL 2018
Punny Captions: Witty Wordplay in Image Descriptions
NAACL 2018
Fast Abstractive Summarization with Reinforce-Selected Sentence Rewriting
ACL 2018
Detecting Linguistic Characteristics of Alzheimerβs Dementia by Interpreting Neural Models
NAACL 2018
Multi-Reward Reinforced Summarization with Saliency and Entailment
NAACL 2018
Dynamic Multi-Level Multi-Task Learning for Sentence Simplification
COLING 2018
Robust Machine Comprehension Models via Adversarial Training
NAACL 2018
Object Ordering with Bidirectional Matchings for Visual Reasoning
NAACL 2018
Parsing Speech: a Neural Approach to Integrating Lexical and Acoustic-Prosodic Information
NAACL 2018
MAttNet: Modular Attention Network for Referring Expression Comprehension
CVPR 2018
Game-Based Video-Context Dialogue
EMNLP 2018
TVQA: Localized, Compositional Video Question Answering
EMNLP 2018
Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorial Abstracts
NAACL 2018
Adversarial Over-Sensitivity and Over-Stability Strategies for Dialogue Models
CONLL 2018
SafeCity: Understanding Diverse Forms of Sexual Harassment Personal Stories
EMNLP 2018
Incorporating Background Knowledge into Video Description Generation
EMNLP 2018
Closed-Book Training to Improve Summarization Encoder Memory
EMNLP 2018
Proceedings of ACL 2017, System Demonstrations
ACL 2017
A Joint Speaker-Listener-Reinforcer Model for Referring Expressions
CVPR 2017
Hierarchically-Attentive RNN for Album Summarization and Storytelling
EMNLP 2017
Video Highlight Prediction Using Audience Chat Reactions
EMNLP 2017
Reinforced Video Captioning with Entailment Rewards
EMNLP 2017
Multi-Task Video Captioning with Video and Entailment Generation
ACL 2017
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Tutorial Abstracts
NAACL 2016
Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
EMNLP 2016
Sort Story: Sorting Jumbled Images and Captions into Stories
EMNLP 2016
Interpreting Neural Networks to Improve Politeness Comprehension
EMNLP 2016
Charagram: Embedding Words and Sentences via Character n-grams
EMNLP 2016
We Are Humor Beings: Understanding and Predicting Visual Humor
CVPR 2016
Who did What: A Large-Scale Person-Centered Cloze Dataset
EMNLP 2016
End-to-End Relation Extraction using LSTMs on Sequences and Tree Structures
ACL 2016
What to talk about and how? Selective Generation using LSTMs with Coarse-to-Fine Alignment
NAACL 2016
The Role of Context Types and Dimensionality in Learning Word Embeddings
NAACL 2016
Deep Multilingual Correlation for Improved Word Embeddings
NAACL 2015
Machine Comprehension with Syntax, Frames, and Semantics
ACL 2015
Machine Comprehension with Syntax, Frames, and Semantics
IJCNLP 2015
What are You Talking About? Text-to-Image Coreference
CVPR 2014
Tailoring Continuous Word Representations for Dependency Parsing
ACL 2014
Structured Learning for Taxonomy Induction with Belief Propagation
ACL 2014
Weakly-Supervised Learning with Cost-Augmented Contrastive Estimation
EMNLP 2014
Unsupervised Translation Sense Clustering
NAACL 2012
Coreference Semantics from Web Features
ACL 2012
Web-Scale Features for Full-Scale Parsing
ACL 2011
Gappy Phrasal Alignment By Agreement
ACL 2011
The Surprising Variance in Shortest-Derivation Parsing
ACL 2011
Mention Detection: Heuristics for the OntoNotes annotations
CONLL 2011
Simple, Accurate Parsing with an All-Fragments Grammar
ACL 2010
Efficient Parsing for Transducer Grammars
NAACL 2009
The Power of Negative Thinking: Exploiting Label Disagreement in the Min-cut Classification Framework
COLING 2008