Dimosthenis Karatzas
21 papers · 2017–2025 · 8 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (8) π Conference Polyglot (8) π Renaissance Researcher (6) πΊοΈ Taxonomy Completionist (34)
πΊοΈ
Taxonomy Completionist
(34)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π€
Dynamic Duo
(14)
π§¬
Topic Evolution
π
Grand Slam
π
Century Club
(21)
ποΈ
Keyword Collector
(80)
π₯
Unstoppable
(9)
β
The Questioner
π
Conference Pioneer
Conferences
WACV (11)
AAAI (2)
CVPR (2)
ECCV (2)
ICCV (1)
ICLR (1)
ICML (1)
NIPS (1)
Top co-authors
Keywords
multimodal learning
(6)
image captioning
(3)
visual reasoning
(3)
text recognition
(3)
visual question answering
(3)
scene text
(3)
self-supervised learning
(2)
image classification
(2)
handwritten text recognition
(2)
multi-modal reasoning
(2)
image retrieval
(2)
named entity recognition
(2)
cross-modal retrieval
(2)
fine-grained classification
(2)
social media analysis
(1)
data augmentation
(1)
transfer learning
(1)
feature learning
(1)
embedding learning
(1)
document understanding
(1)
Papers
DocMIA: Document-Level Membership Inference Attacks against DocVQA Models
ICLR 2025
DocVXQA: Context-Aware Visual Explanations for Document Question Answering
ICML 2025
CoMix: A Comprehensive Benchmark for Multi-Task Comic Understanding
NIPS 2024
STEP - Towards Structured Scene-Text Spotting
WACV 2024
Watching the News: Towards VideoQA Models That Can Read
WACV 2023
Text-DIAE: A Self-Supervised Degradation Invariant Autoencoder for Text Recognition and Document Enhancement
AAAI 2023
Show, Interpret and Tell: Entity-Aware Contextualised Image Captioning in Wikipedia
AAAI 2023
InfographicVQA
WACV 2022
One-Shot Compositional Data Generation for Low Resource Handwritten Text Recognition
WACV 2022
Is an Image Worth Five Sentences? A New Look Into Semantics for Image-Text Matching
WACV 2022
Let There Be a Clock on the Beach: Reducing Object Hallucination in Image Captioning
WACV 2022
Multi-Modal Reasoning Graph for Scene-Text Based Fine-Grained Image Classification and Retrieval
WACV 2021
StacMR: Scene-Text Aware Cross-Modal Retrieval
WACV 2021
DocVQA: A Dataset for VQA on Document Images
WACV 2021
Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features
WACV 2020
Location Sensitive Image Retrieval and Tagging
ECCV 2020
Exploring Hate Speech Detection in Multimodal Publications
WACV 2020
Good News, Everyone! Context Driven Entity-Aware Captioning for News Images
CVPR 2019
Scene Text Visual Question Answering
ICCV 2019
Single Shot Scene Text Retrieval
ECCV 2018
Self-Supervised Learning of Visual Features Through Embedding Images Into Text Topic Spaces
CVPR 2017