conftrace
_
Papers
Trends
Conferences
Explore
More
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
← Keywords
multimodal learning
4645 papers
Explore in graph
Co-occurring keywords
large language model
(13587)
vision-language model
(2348)
visual question answering
(1017)
video understanding
(1658)
multi-modal learning
(1278)
contrastive learning
(4032)
representation learning
(6206)
transfer learning
(5449)
zero-shot learning
(3650)
vision language model
(767)
Papers
Quartet@LT-EDI 2024: A SVM-ResNet50 Approach For Multitask Meme Classification - Unraveling Misogynistic and Trolls in Online Memes
EACL 2024
WikiScenes with Descriptions: Aligning Paragraphs and Sentences with Images in Wikipedia Articles
NAACL 2024
SheffieldVeraAI at SemEval-2024 Task 4: Prompting and fine-tuning a Large Vision-Language Model for Binary Classification of Persuasion Techniques in Memes
NAACL 2024
MemeGuard: An LLM and VLM-based Framework for Advancing Content Moderation via Meme Intervention
ACL 2024
A Mapping on Current Classifying Categories of Emotions Used in Multimodal Models for Emotion Recognition
EACL 2024
Continual Multimodal Knowledge Graph Construction
IJCAI 2024
JMI at SemEval 2024 Task 3: Two-step approach for multimodal ECAC using in-context learning with GPT and instruction-tuned Llama models
NAACL 2024
Wit Hub@DravidianLangTech-2024:Multimodal Social Media Data Analysis in Dravidian Languages using Machine Learning Models
EACL 2024
SMILE: Multimodal Dataset for Understanding Laughter in Video with Language Models
NAACL 2024
Re-evaluating the Need for Visual Signals in Unsupervised Grammar Induction
NAACL 2024
Multimodal Contextualized Semantic Parsing from Speech
ACL 2024
Chart-based Reasoning: Transferring Capabilities from LLMs to VLMs
NAACL 2024
Functional Graph Convolutional Networks: A Unified Multi-task and Multi-modal Learning Framework to Facilitate Health and Social-Care Insights
IJCAI 2024
Which Modality should I use - Text, Motif, or Image? : Understanding Graphs with Large Language Models
NAACL 2024
Multimodal Fallacy Classification in Political Debates
EACL 2024
Multimodal Contextual Dialogue Breakdown Detection for Conversational AI Models
NAACL 2024
A Concise Report of the 7th Workshop on Challenges and Applications of Automated Extraction of Socio-political Events from Text
EACL 2024
Generating Signed Language Instructions in Large-Scale Dialogue Systems
NAACL 2024
DiffCLIP: Leveraging Stable Diffusion for Language Grounded 3D Classification
WACV 2024
MasonPerplexity at Multimodal Hate Speech Event Detection 2024: Hate Speech and Target Detection Using Transformer Ensembles
EACL 2024
M3T: A New Benchmark Dataset for Multi-Modal Document-Level Machine Translation
NAACL 2024
LLM Knows Body Language, Too: Translating Speech Voices into Human Gestures
ACL 2024
Efficient End-to-End Visual Document Understanding with Rationale Distillation
NAACL 2024
MaCSC: Towards Multimodal-augmented Pre-trained Language Models via Conceptual Prototypes and Self-balancing Calibration
NAACL 2024
CLTL@Multimodal Hate Speech Event Detection 2024: The Winning Approach to Detecting Multimodal Hate Speech and Its Targets
EACL 2024
<
1
…
64
65
66
…
186
>