Muhammad Maaz
9 papers · 2022–2025 · 7 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+3 more ↓ Show less ↑
π Cross-Pollinator (15) π Conference Polyglot (7) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (24)
π§
Keyword Pioneer
π£
Hot Topic Early Bird
π₯
Mega-Team
(29)
Conferences
CVPR (3)
ACL (1)
ECCV (1)
EMNLP (1)
ICCV (1)
NIPS (1)
WACV (1)
Top co-authors
Keywords
vision-language model
(3)
transfer learning
(3)
video understanding
(2)
large multimodal model
(2)
vision language model
(2)
visual grounding
(2)
multimodal learning
(2)
object detection
(1)
image segmentation
(1)
multi-modal learning
(1)
visual encoder
(1)
domain generalization
(1)
prompt learning
(1)
instruction tuning
(1)
machine translation
(1)
open-vocabulary detection
(1)
few-shot learning
(1)
model scaling
(1)
zero-shot learning
(1)
representation learning
(1)
Papers
PALO: A Polyglot Large Multimodal Model for 5B People
WACV 2025
A Culturally-diverse Multilingual Multimodal Video Benchmark & Model
EMNLP 2025
GLaMM: Pixel Grounding Large Multimodal Model
CVPR 2024
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
ACL 2024
Fine-Tuned CLIP Models Are Efficient Video Learners
CVPR 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
ICCV 2023
MaPLe: Multi-Modal Prompt Learning
CVPR 2023
Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection
NIPS 2022
Class-Agnostic Object Detection with Multi-modal Transformer
ECCV 2022