Jorma Laaksonen
16 papers · 2017–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+8 more ↓ Show less ↑
π Interdisciplinary Bridge π Conference Polyglot (10) π Academic Marathon (8) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (35)
π£
Hot Topic Early Bird
π
Conference Polyglot
(10)
π
Academic Marathon
(8)
π§¬
Topic Evolution
π₯
Mega-Team
(69)
π
Century Club
(16)
β
The Questioner
ποΈ
Keyword Collector
(68)
Conferences
CVPR (4)
COLING (2)
EMNLP (2)
ICCV (2)
AACL (1)
ACL (1)
ECCV (1)
IJCNLP (1)
NAACL (1)
WACV (1)
Top co-authors
Keywords
multimodal learning
(6)
image captioning
(2)
large language model
(2)
large multimodal model
(2)
transfer learning
(2)
benchmark evaluation
(2)
vision-language model
(2)
image difference captioning
(2)
multilingual nlp
(1)
attention mechanism
(1)
domain generalization
(1)
weakly supervised learning
(1)
semantic segmentation
(1)
conversational ai
(1)
medical imaging
(1)
video segmentation
(1)
temporal modeling
(1)
sentiment analysis
(1)
contextual attention
(1)
model evaluation
(1)
Papers
All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages
CVPR 2025
TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation
CVPR 2025
Learning to Describe Implicit Changes: Noise-robust Pre-training for Image Difference Captioning
EMNLP 2025
CAMEL-Bench: A Comprehensive Arabic LMM Benchmark
NAACL 2025
XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models
ACL 2024
Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer
COLING 2024
Person Image Synthesis via Denoising Diffusion Model
CVPR 2023
Learning by Hallucinating: Vision-Language Pre-Training With Weak Supervision
WACV 2023
DoodleFormer: Creative Sketch Drawing with Transformers
ECCV 2022
When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity
COLING 2022
CLIP4IDC: CLIP for Image Difference Captioning
AACL 2022
CLIP4IDC: CLIP for Image Difference Captioning
IJCNLP 2022
Deep Contextual Attention for Human-Object Interaction Detection
ICCV 2019
The MeMAD Submission to the WMT18 Multimodal Translation Task
EMNLP 2018
Saliency Revisited: Analysis of Mouse Movements Versus Fixations
CVPR 2017
Paying Attention to Descriptions Generated by Image Captioning Models
ICCV 2017