Dimosthenis Karatzas

21 papers · 2017–2025 · 8 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🌉 Interdisciplinary Bridge 🏃 Academic Marathon (8) 🌍 Conference Polyglot (8) 🌈 Renaissance Researcher (6) 🗺️ Taxonomy Completionist (34)

🗺️ Taxonomy Completionist (34) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🤝 Dynamic Duo (14) 🧬 Topic Evolution 🏆 Grand Slam 💎 Century Club (21) 🗃️ Keyword Collector (80) 🔥 Unstoppable (9) ❓ The Questioner 🚀 Conference Pioneer

Conferences

WACV (11) AAAI (2) CVPR (2) ECCV (2) ICCV (1) ICLR (1) ICML (1) NIPS (1)

Top co-authors

Lluis Gomez (14) Ali Furkan Biten (9) Andres Mafla (8) Marcal Rusinol (5) C.V. Jawahar (3) Minesh Mathew (3) Mohamed Ali Souibgui (3) Sounak Dey (3) Ernest Valveny (3) Josep Lladós (2)

Keywords

multimodal learning (6) image captioning (3) visual reasoning (3) text recognition (3) visual question answering (3) scene text (3) self-supervised learning (2) image classification (2) handwritten text recognition (2) multi-modal reasoning (2) image retrieval (2) named entity recognition (2) cross-modal retrieval (2) fine-grained classification (2) social media analysis (1) data augmentation (1) transfer learning (1) feature learning (1) embedding learning (1) document understanding (1)

Papers

DocMIA: Document-Level Membership Inference Attacks against DocVQA Models ICLR 2025 DocVXQA: Context-Aware Visual Explanations for Document Question Answering ICML 2025 CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding NIPS 2024 STEP - Towards Structured Scene-Text Spotting WACV 2024 Watching the News: Towards VideoQA Models That Can Read WACV 2023 Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement AAAI 2023 Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia AAAI 2023 InfographicVQA WACV 2022 One-Shot Compositional Data Generation for Low Resource Handwritten Text Recognition WACV 2022 Is an Image Worth Five Sentences? A New Look Into Semantics for Image-Text Matching WACV 2022 Let There Be a Clock on the Beach: Reducing Object Hallucination in Image Captioning WACV 2022 Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval WACV 2021 StacMR: Scene-Text Aware Cross-Modal Retrieval WACV 2021 DocVQA: A Dataset for VQA on Document Images WACV 2021 Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features WACV 2020 Location Sensitive Image Retrieval and Tagging ECCV 2020 Exploring Hate Speech Detection in Multimodal Publications WACV 2020 Good News, Everyone! Context Driven Entity-Aware Captioning for News Images CVPR 2019 Scene Text Visual Question Answering ICCV 2019 Single Shot Scene Text Retrieval ECCV 2018 Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces CVPR 2017