Alaaeldin El-Nouby
12 papers · 2019–2025 · 5 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+6 more ↓ Show less ↑
π Interdisciplinary Bridge π Academic Marathon (6) π Conference Polyglot (5) π Renaissance Researcher (5) πΊοΈ Taxonomy Completionist (25)
π£
Hot Topic Early Bird
π
Renaissance Researcher
(5)
π
Interdisciplinary Bridge
π₯
Mega-Team
(60)
π₯
Unstoppable
(5)
π
Century Club
(12)
Conferences
ICML (4)
CVPR (3)
ICCV (3)
ECCV (1)
NIPS (1)
Top co-authors
Keywords
vision transformer
(2)
foundation model
(2)
multimodal learning
(2)
multi-modal learning
(2)
self-supervised learning
(1)
attention mechanism
(1)
contrastive learning
(1)
scaling law
(1)
image reconstruction
(1)
audio-visual learning
(1)
video understanding
(1)
text-to-image generation
(1)
cross-modal retrieval
(1)
generative model
(1)
variational autoencoder
(1)
convolutional neural network
(1)
language model
(1)
zero-shot learning
(1)
image classification
(1)
representation learning
(1)
Papers
Scaling Laws for Native Multimodal Models
ICCV 2025
FlexTok: Resampling Images into 1D Token Sequences of Flexible Length
ICML 2025
Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models
ICML 2025
Multimodal Autoregressive Pre-training of Large Vision Encoders
CVPR 2025
DataComp-LM: In search of the next generation of training sets for language models
NIPS 2024
Scalable Pre-training of Large Autoregressive Image Models
ICML 2024
Improving Statistical Fidelity for Neural Image Compression with Implicit Local Likelihood Models
ICML 2023
OmniMAE: Single Model Masked Pretraining on Images and Videos
CVPR 2023
ImageBind: One Embedding Space To Bind Them All
CVPR 2023
Three Things Everyone Should Know about Vision Transformers
ECCV 2022
LeViT: A Vision Transformer in ConvNet's Clothing for Faster Inference
ICCV 2021
Tell, Draw, and Repeat: Generating and Modifying Images Based on Continual Linguistic Instruction
ICCV 2019