conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
VLGrammar: Grounded Grammar Induction of Vision and Language
ICCV 2021
Linguistically Routing Capsule Network for Out-of-Distribution Visual Question Answering
ICCV 2021
Motion Guided Attention Fusion To Recognize Interactions From Videos
ICCV 2021
Generic Attention-Model Explainability for Interpreting Bi-Modal and Encoder-Decoder Transformers
ICCV 2021
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
ICCV 2021
Video Question Answering Using Language-Guided Deep Compressed-Domain Video Feature
ICCV 2021
Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis
ICCV 2021
End-to-End Trainable Trident Person Search Network Using Adaptive Gradient Propagation
ICCV 2021
Visio-Temporal Attention for Multi-Camera Multi-Target Association
ICCV 2021
Wasserstein Coupled Graph Learning for Cross-Modal Retrieval
ICCV 2021
Learning To Cut by Watching Movies
ICCV 2021
SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation
ICCV 2021
Physics-Enhanced Machine Learning for Virtual Fluorescence Microscopy
ICCV 2021
Learning by Aligning: Visible-Infrared Person Re-Identification Using Cross-Modal Correspondences
ICCV 2021
Localize to Binauralize: Audio Spatialization From Visual Sound Source Localization
ICCV 2021
SPEC: Seeing People in the Wild With an Estimated Camera
ICCV 2021
Graph-Based 3D Multi-Person Pose Estimation Using Multi-View Images
ICCV 2021
COOKIE: Contrastive Cross-Modal Knowledge Sharing Pre-Training for Vision-Language Representation
ICCV 2021
Full-Velocity Radar Returns by Radar-Camera Fusion
ICCV 2021
Parsing Table Structures in the Wild
ICCV 2021
Joint Visual and Audio Learning for Video Highlight Detection
ICCV 2021
Weakly Supervised Text-Based Person Re-Identification
ICCV 2021
Attention Is Not Enough: Mitigating the Distribution Discrepancy in Asynchronous Multimodal Sequence Fusion
ICCV 2021
Class Semantics-Based Attention for Action Detection
ICCV 2021
Adversarial Attack on Deep Cross-Modal Hamming Retrieval
ICCV 2021
<
1
…
425
426
427
…
523
>