Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Core AI
Artificial Intelligence
›
Core AI
›
Multimodal Learning
13057 directly classified papers
Papers per year
2003: 1
2006: 3
2007: 6
2008: 2
2009: 5
2010: 2
2011: 3
2012: 6
2013: 24
2014: 20
2015: 46
2016: 109
2017: 205
2018: 299
2019: 622
2020: 675
2021: 987
2022: 1084
2023: 1697
2024: 2500
2025: 3654
2026: 1107
Papers
Explicit Knowledge-based Reasoning for Visual Question Answering
IJCAI 2017
Hierarchical LSTM with Adjusted Temporal Attention for Video Captioning
IJCAI 2017
DeepStory: Video Story QA by Deep Embedded Memory Networks
IJCAI 2017
Auditory-Visual Integration of Talker Gender in Cantonese Tone Perception
INTERSPEECH 2017
Computing Multimodal Dyadic Behaviors During Spontaneous Diagnosis Interviews Toward Automatic Categorization of Autism Spectrum Disorder
INTERSPEECH 2017
Modal Consistency based Pre-Trained Multi-Model Reuse
IJCAI 2017
A Domain Knowledge-Assisted Nonlinear Model for Head-Related Transfer Functions Based on Bottleneck Deep Neural Network
INTERSPEECH 2017
Multimodal Markers of Persuasive Speech: Designing a Virtual Debate Coach
INTERSPEECH 2017
An Information Theoretic Analysis of the Temporal Synchrony Between Head Gestures and Prosodic Patterns in Spontaneous Speech
INTERSPEECH 2017
An Attention-based Regression Model for Grounding Textual Phrases in Images
IJCAI 2017
Predicting the Quality of Short Narratives from Social Media
IJCAI 2017
Image-embodied Knowledge Representation Learning
IJCAI 2017
Cross-modal Common Representation Learning by Hybrid Transfer Network
IJCAI 2017
Co-Production of Speech and Pointing Gestures in Clear and Perturbed Interactive Tasks: Multimodal Designation Strategies
INTERSPEECH 2017
Modelling the Informativeness of Non-Verbal Cues in Parent-Child Interaction
INTERSPEECH 2017
Attentive Convolutional Neural Network Based Speech Emotion Recognition: A Study on the Impact of Input Features, Signal Length, and Acted Speech
INTERSPEECH 2017
Opinion Dynamics Modeling for Movie Review Transcripts Classification with Hidden Conditional Random Fields
INTERSPEECH 2017
Multi-Task Learning for Prosodic Structure Generation Using BLSTM RNN with Structured Output Layer
INTERSPEECH 2017
Dual Track Multimodal Automatic Learning through Human-Robot Interaction
IJCAI 2017
Multimodal Storytelling via Generative Adversarial Imitation Learning
IJCAI 2017
End-to-end optimization of goal-driven and visually grounded dialogue systems
IJCAI 2017
Remote Articulation Test System Based on WebRTC
INTERSPEECH 2017
Constructing Acoustic Distances Between Subwords and States Obtained from a Deep Neural Network for Spoken Term Detection
INTERSPEECH 2017
Speech Emotion Recognition with Emotion-Pair Based Framework Considering Emotion Distribution Information in Dimensional Emotion Space
INTERSPEECH 2017
Semi Parametric Concatenative TTS with Instant Voice Modification Capabilities
INTERSPEECH 2017
<
1
…
507
508
509
…
523
>