conftrace_

Muhammad Maaz

9 papers · 2022–2025 · 7 conferences · across top CS/AI conferences

Achievements

Jump to papers ↓

+3 more ↓

🐝 Cross-Pollinator (15) 🌍 Conference Polyglot (7) 🌈 Renaissance Researcher (5) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (24)

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 👥 Mega-Team (29)

Conferences

CVPR (3) ACL (1) ECCV (1) EMNLP (1) ICCV (1) NIPS (1) WACV (1)

Top co-authors

Salman Khan (8) Hanoona Rasheed (7) Fahad Shahbaz Khan (6) Muhammad Uzair Khattak (3) Hisham Cholakkal (3) Ming-Hsuan Yang (3) Abdelrahman Shaker (3) Michael Felsberg (2) Rao M. Anwer (2) Fahad S. Khan (2)

Keywords

vision-language model (3) transfer learning (3) video understanding (2) large multimodal model (2) vision language model (2) visual grounding (2) multimodal learning (2) object detection (1) image segmentation (1) multi-modal learning (1) visual encoder (1) domain generalization (1) prompt learning (1) instruction tuning (1) machine translation (1) open-vocabulary detection (1) few-shot learning (1) model scaling (1) zero-shot learning (1) representation learning (1)

Papers

PALO: A Polyglot Large Multimodal Model for 5B People WACV 2025 A Culturally-diverse Multilingual Multimodal Video Benchmark & Model EMNLP 2025 GLaMM: Pixel Grounding Large Multimodal Model CVPR 2024 Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models ACL 2024 Fine-Tuned CLIP Models Are Efficient Video Learners CVPR 2023 SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications ICCV 2023 MaPLe: Multi-Modal Prompt Learning CVPR 2023 Bridging the Gap between Object and Image-level Representations for Open-Vocabulary Detection NIPS 2022 Class-Agnostic Object Detection with Multi-modal Transformer ECCV 2022