conftrace
_
Papers
Trends
Conferences
Explore
Authors
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13,057 papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Pose-Controllable Talking Face Generation by Implicitly Modularized Audio-Visual Representation
CVPR 2021
Learning Better Visual Dialog Agents With Pretrained Visual-Linguistic Representation
CVPR 2021
Spoken Moments: Learning Joint Audio-Visual Representations From Video Descriptions
CVPR 2021
Shared Cross-Modal Trajectory Prediction for Autonomous Driving
CVPR 2021
Adversarial Laser Beam: Effective Physical-World Attack to DNNs in a Blink
CVPR 2021
Facial Action Unit Detection With Transformers
CVPR 2021
Modeling Multi-Label Action Dependencies for Temporal Action Localization
CVPR 2021
Causal Attention for Vision-Language Tasks
CVPR 2021
Revamping Cross-Modal Recipe Retrieval With Hierarchical Transformers and Self-Supervised Learning
CVPR 2021
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality
CVPR 2021
Looking Into Your Speech: Learning Cross-Modal Affinity for Audio-Visual Speech Separation
CVPR 2021
MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments From a Single Moving Camera
CVPR 2021
Linguistic Structures As Weak Supervision for Visual Scene Graph Generation
CVPR 2021
On the (In)Effectiveness of Images for Text Classification
EACL 2021
CDˆ2CR: Co-reference resolution across documents and domains
EACL 2021
Bootstrapping Multilingual AMR with Contextual Word Alignments
EACL 2021
MONAH: Multi-Modal Narratives for Humans to analyze conversations
EACL 2021
Cross-lingual Entity Alignment with Incidental Supervision
EACL 2021
FakeFlow: Fake News Detection by Modeling the Flow of Affective Information
EACL 2021
Multiple Tasks Integration: Tagging, Syntactic and Semantic Parsing as a Single Task
EACL 2021
MIDAS: A Dialog Act Annotation Scheme for Open Domain HumanMachine Spoken Conversations
EACL 2021
Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation
EACL 2021
ECOL-R: Encouraging Copying in Novel Object Captioning with Reinforcement Learning
EACL 2021
On the Evaluation of Vision-and-Language Navigation Instructions
EACL 2021
Cross-lingual Visual Pre-training for Multimodal Machine Translation
EACL 2021
<
1
…
416
417
418
…
523
>