Hassan Akbari
6 papers · 2019–2024 · 4 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+4 more ↓ Show less ↑
π Conference Polyglot (4) π Academic Marathon (5) π£ Hot Topic Early Bird π Cross-Pollinator (15) π Interdisciplinary Bridge
πΊοΈ
Taxonomy Completionist
(16)
π§
Keyword Pioneer
π₯
Mega-Team
(31)
π
Trend Setter
Conferences
NIPS (3)
CVPR (1)
ICLR (1)
ICML (1)
Top co-authors
Keywords
self-supervised learning
(2)
multimodal learning
(2)
contrastive learning
(2)
curriculum learning
(1)
visual grounding
(1)
semantic space
(1)
convolutional neural network
(1)
visual feature
(1)
alternating gradient descent
(1)
cosine similarity
(1)
transformer encoder
(1)
multimodal attention
(1)
multimodal transformer
(1)
phrase grounding
(1)
video action recognition
(1)
cross-modality alignment
(1)
multimodal contrastive learning
(1)
text-to-video retrieval
(1)
multimodal pre-training
(1)
video-audio-text alignment
(1)
Papers
VideoPoet: A Large Language Model for Zero-Shot Video Generation
ICML 2024
Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
NIPS 2023
PaLI: A Jointly-Scaled Multilingual Language-Image Model
ICLR 2023
Scaling Multimodal Pre-Training via Cross-Modality Gradient Harmonization
NIPS 2022
VATT: Transformers for Multimodal Self-Supervised Learning from Raw Video, Audio and Text
NIPS 2021
Multi-Level Multimodal Common Semantic Space for Image-Phrase Grounding
CVPR 2019