Subhojit Som
7 papers · 2020–2026 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (4) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Mega-Team
(24)
Conferences
NIPS (3)
ACL (1)
ACML (1)
CVPR (1)
EMNLP (1)
Top co-authors
Keywords
multimodal learning
(3)
zero-shot learning
(2)
visual question answering
(2)
vision-language model
(2)
transfer learning
(1)
in-context learning
(1)
image captioning
(1)
deep learning
(1)
vision language model
(1)
foundation model
(1)
multimodal large language model
(1)
transformer network
(1)
image-text retrieval
(1)
multimodal reasoning
(1)
data synthesis
(1)
cloud computing
(1)
visual document understanding
(1)
reasoning augmentation
(1)
multilingual learning
(1)
multimodal pretraining
(1)
Papers
WebSTAR: Scalable Data Synthesis for Computer Use Agents with Step-Level Filtering
ACL 2026
Image as a Foreign Language: BEiT Pretraining for Vision and Vision-Language Tasks
CVPR 2023
DUBLIN: Visual Document Understanding By Language-Image Network
EMNLP 2023
Language Is Not All You Need: Aligning Perception with Language Models
NIPS 2023
Bootstrapping a high quality multilingual multimodal
dataset for Bletchley
ACML 2022
VLMo: Unified Vision-Language Pre-Training with Mixture-of-Modality-Experts
NIPS 2022
Pushing the Limits of Narrow Precision Inferencing at Cloud Scale with Microsoft Floating Point
NIPS 2020