Jorma Laaksonen

16 papers · 2017–2025 · 10 conferences · across top CS/AI conferences

Achievements

+8 more ↓

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (10) 🏃 Academic Marathon (8) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (35)

🐣 Hot Topic Early Bird 🌍 Conference Polyglot (10) 🏃 Academic Marathon (8) 🧬 Topic Evolution 👥 Mega-Team (69) 💎 Century Club (16) ❓ The Questioner 🗃️ Keyword Collector (68)

Conferences

CVPR (4) COLING (2) EMNLP (2) ICCV (2) AACL (1) ACL (1) ECCV (1) IJCNLP (1) NAACL (1) WACV (1)

Top co-authors

Rao Muhammad Anwer (6) Fahad Shahbaz Khan (5) Salman Khan (5) Hisham Cholakkal (4) Zixin Guo (3) Abduljalil Radman (3) Ali Borji (2) Michael Felsberg (2) Abdelrahman M. Shaker (2) Tzu-Jui Julius Wang (2)

Keywords

multimodal learning (6) image captioning (2) large language model (2) large multimodal model (2) transfer learning (2) benchmark evaluation (2) vision-language model (2) image difference captioning (2) multilingual nlp (1) attention mechanism (1) domain generalization (1) weakly supervised learning (1) semantic segmentation (1) conversational ai (1) medical imaging (1) video segmentation (1) temporal modeling (1) sentiment analysis (1) contextual attention (1) model evaluation (1)

Papers

All Languages Matter: Evaluating LMMs on Culturally Diverse 100 Languages CVPR 2025 TSAM: Temporal SAM Augmented with Multimodal Prompts for Referring Audio-Visual Segmentation CVPR 2025 Learning to Describe Implicit Changes: Noise-robust Pre-training for Image Difference Captioning EMNLP 2025 CAMEL-Bench: A Comprehensive Arabic LMM Benchmark NAACL 2025 XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models ACL 2024 Text-to-Multimodal Retrieval with Bimodal Input Fusion in Shared Cross-Modal Transformer COLING 2024 Person Image Synthesis via Denoising Diffusion Model CVPR 2023 Learning by Hallucinating: Vision-Language Pre-Training With Weak Supervision WACV 2023 DoodleFormer: Creative Sketch Drawing with Transformers ECCV 2022 When to Laugh and How Hard? A Multimodal Approach to Detecting Humor and Its Intensity COLING 2022 CLIP4IDC: CLIP for Image Difference Captioning AACL 2022 CLIP4IDC: CLIP for Image Difference Captioning IJCNLP 2022 Deep Contextual Attention for Human-Object Interaction Detection ICCV 2019 The MeMAD Submission to the WMT18 Multimodal Translation Task EMNLP 2018 Saliency Revisited: Analysis of Mouse Movements Versus Fixations CVPR 2017 Paying Attention to Descriptions Generated by Image Captioning Models ICCV 2017