conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Positive Sample Propagation Along the Audio-Visual Event Line
CVPR 2021
StylePeople: A Generative Model of Fullbody Human Avatars
CVPR 2021
Intentonomy: A Dataset and Study Towards Human Intent Understanding
CVPR 2021
Hybrid Message Passing With Performance-Driven Structures for Facial Action Unit Detection
CVPR 2021
HOTR: End-to-End Human-Object Interaction Detection With Transformers
CVPR 2021
ChallenCap: Monocular 3D Capture of Challenging Human Performances Using Multi-Modal References
CVPR 2021
Zillow Indoor Dataset: Annotated Floor Plans With 360deg Panoramas and 3D Room Layouts
CVPR 2021
Learning Triadic Belief Dynamics in Nonverbal Communication From Videos
CVPR 2021
GANmut: Learning Interpretable Conditional Space for Gamut of Emotions
CVPR 2021
TediGAN: Text-Guided Diverse Face Image Generation and Manipulation
CVPR 2021
Towards Diverse Paragraph Captioning for Untrimmed Videos
CVPR 2021
Dual-GAN: Joint BVP and Noise Modeling for Remote Physiological Measurement
CVPR 2021
Multi-Modal Relational Graph for Cross-Modal Video Moment Retrieval
CVPR 2021
Roses Are Red, Violets Are Blue... but Should VQA Expect Them To?
CVPR 2021
Understanding Object Dynamics for Interactive Image-to-Video Synthesis
CVPR 2021
Semantic Audio-Visual Navigation
CVPR 2021
PAConv: Position Adaptive Convolution With Dynamic Kernel Assembling on Point Clouds
CVPR 2021
Bridge To Answer: Structure-Aware Graph Interaction Network for Video Question Answering
CVPR 2021
Structured Scene Memory for Vision-Language Navigation
CVPR 2021
Cross-Modal Collaborative Representation Learning and a Large-Scale RGBT Benchmark for Crowd Counting
CVPR 2021
Predicting Human Scanpaths in Visual Question Answering
CVPR 2021
StereoPIFu: Depth Aware Clothed Human Digitization via Stereo Vision
CVPR 2021
Right for the Right Concept: Revising Neuro-Symbolic Concepts by Interacting With Their Explanations
CVPR 2021
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
CVPR 2021
Learning Camera Localization via Dense Scene Matching
CVPR 2021
<
1
…
413
414
415
…
523
>