conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Adaptive-Attentive Geolocalization From Few Queries: A Hybrid Approach
WACV 2021
SubICap: Towards Subword-Informed Image Captioning
WACV 2021
DocVQA: A Dataset for VQA on Document Images
WACV 2021
Integrating Human Gaze Into Attention for Egocentric Activity Recognition
WACV 2021
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal Memes
NIPS 2020
Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies
NIPS 2020
Video Object Segmentation with Adaptive Feature Bank and Uncertain-Region Refinement
NIPS 2020
Language Through a Prism: A Spectral Approach for Multiscale Language Representations
NIPS 2020
Multimodal Generative Learning Utilizing Jensen-Shannon-Divergence
NIPS 2020
Compositional Visual Generation with Energy Based Models
NIPS 2020
Exchangeable Neural ODE for Set Modeling
NIPS 2020
Language and Visual Entity Relationship Graph for Agent Navigation
NIPS 2020
Causal Discovery in Physical Systems from Videos
NIPS 2020
Non-reversible Gaussian processes for identifying latent dynamical structure in neural data
NIPS 2020
Self-Supervised Learning by Cross-Modal Audio-Video Clustering
NIPS 2020
UDH: Universal Deep Hiding for Steganography, Watermarking, and Light Field Messaging
NIPS 2020
Rel3D: A Minimally Contrastive Benchmark for Grounding Spatial Relations in 3D
NIPS 2020
Gibbs Sampling with People
NIPS 2020
Robust Optimal Transport with Applications in Generative Modeling and Domain Adaptation
NIPS 2020
Language-Conditioned Imitation Learning for Robot Manipulation Tasks
NIPS 2020
See, Hear, Explore: Curiosity via Audio-Visual Association
NIPS 2020
COBE: Contextualized Object Embeddings from Narrated Instructional Video
NIPS 2020
RATT: Recurrent Attention to Transient Tasks for Continual Image Captioning
NIPS 2020
Dense Correspondences between Human Bodies via Learning Transformation Synchronization on Graphs
NIPS 2020
Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation
NIPS 2020
<
1
…
441
442
443
…
523
>