Co-occurring keywords
Papers
JNLP at SemEval-2025 Task 1: Multimodal Idiomaticity Representation with Large Language Models
ACL 2025
Detecting Referring Expressions in Visually Grounded Dialogue with Autoregressive Language Models
ACL 2025
Tri-Ergon: Fine-Grained Video-to-Audio Generation with Multi-Modal Conditions and LUFS Control
AAAI 2025
CognitionCapturer: Decoding Visual Stimuli from Human EEG Signal with Multimodal Information
AAAI 2025
Evaluating LLM-Generated Diagrams as Graphs
EMNLP 2025