Owais Khan Mohammed
5 papers · 2022–2023 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+2 more ↓ Show less ↑
π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (12) π§ Keyword Pioneer
π£
Hot Topic Early Bird
π
Cross-Pollinator
(15)
Conferences
NIPS (2)
ACML (1)
CVPR (1)
EMNLP (1)
Top co-authors
Keywords
multimodal learning
(3)
visual question answering
(2)
zero-shot learning
(2)
vision-language model
(2)
in-context learning
(1)
image captioning
(1)
vision language model
(1)
foundation model
(1)
multimodal large language model
(1)
transformer network
(1)
image-text retrieval
(1)
multimodal reasoning
(1)
visual document understanding
(1)
multilingual learning
(1)
multimodal pretraining
(1)
masked image modeling
(1)
fusion encoder
(1)
language-image network
(1)
ocr-free processing
(1)
transformer architecture
(1)
Papers
Language Is Not All You Need: Aligning Perception with Language Models
NIPS 2023
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks
CVPR 2023
DUBLIN: Visual Document Understanding By Language-Image Network
EMNLP 2023
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
NIPS 2022
Bootstrapping a high quality multilingual multimodal
dataset for Bletchley
ACML 2022