Hanoona Rasheed
7 papers · 2022–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+1 more ↓ Show less ↑
π Conference Polyglot (5) π Renaissance Researcher (5) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (19) π§ Keyword Pioneer
π
Cross-Pollinator
(15)
Conferences
CVPR (3)
ACL (1)
ECCV (1)
ICCV (1)
WACV (1)
Top co-authors
Keywords
vision-language model
(3)
vision language model
(2)
transfer learning
(2)
video understanding
(2)
multimodal learning
(2)
multi-modal learning
(1)
visual grounding
(1)
prompt learning
(1)
instruction tuning
(1)
large multimodal model
(1)
model scaling
(1)
visual encoder
(1)
multimodal model
(1)
model efficiency
(1)
visual language model
(1)
efficient attention
(1)
large language model
(1)
conversation system
(1)
referring expression segmentation
(1)
segmentation mask
(1)
Papers
PALO: A Polyglot Large Multimodal Model for 5B People
WACV 2025
GLaMM: Pixel Grounding Large Multimodal Model
CVPR 2024
Video-ChatGPT: Towards Detailed Video Understanding via Large Vision and Language Models
ACL 2024
MaPLe: Multi-Modal Prompt Learning
CVPR 2023
SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications
ICCV 2023
Fine-Tuned CLIP Models Are Efficient Video Learners
CVPR 2023
Class-Agnostic Object Detection with Multi-modal Transformer
ECCV 2022