Samuel Albanie
41 papers · 2018–2025 · 10 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+11 more ↓ Show less ↑
π§ Keyword Pioneer π Conference Polyglot (10) π Interdisciplinary Bridge πΊοΈ Taxonomy Completionist (14) π Academic Marathon (7)
π
Academic Marathon
(7)
π
Cross-Pollinator
(10)
π
Renaissance Researcher
(9)
π
Keyword Champion
(2)
π§¬
Topic Evolution
π
Conference Pioneer
π₯
Unstoppable
(8)
ποΈ
Keyword Collector
(176)
β
The Questioner
(2)
π
Century Club
(41)
β‘
Prolific Year
(5)
Conferences
CVPR (9)
ICCV (9)
NIPS (8)
ACL (4)
ECCV (4)
ICLR (3)
AAAI (1)
INTERSPEECH (1)
JMLR (1)
WACV (1)
Top co-authors
Keywords
benchmark evaluation
(4)
representation learning
(4)
multimodal learning
(4)
zero-shot learning
(3)
temporal localization
(3)
large language model
(3)
cross-modal retrieval
(3)
foundation model
(3)
video retrieval
(3)
transformer architecture
(2)
sign language
(2)
text-based retrieval
(2)
transfer learning
(2)
stochastic gradient descent
(2)
knowledge distillation
(2)
self-supervised learning
(2)
model evaluation
(2)
large multimodal model
(2)
zero-shot classification
(2)
feature learning
(1)
Papers
Inverse Constitutional AI: Compressing Preferences into Principles
ICLR 2025
GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models
ICCV 2025
How to Merge Your Multimodal Models Over Time?
CVPR 2025
GAMEBoT: Transparent Assessment of LLM Reasoning in Games
ACL 2025
ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities
ACL 2025
Active Data Curation Effectively Distills Large-Scale Multimodal Models
CVPR 2025
Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks?
ICLR 2025
DeepMIM: Deep Supervision for Masked Image Modeling
WACV 2025
HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits
ACL 2024
SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation
NIPS 2024
No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance
NIPS 2024
Efficient Lifelong Model Evaluation in an Era of Rapid Progress
NIPS 2024
On scalable oversight with weak LLMs judging strong LLMs
NIPS 2024
A Practitioner's Guide to Real-World Continual Multimodal Pretraining
NIPS 2024
InstructVideo: Instructing Video Diffusion Models with Human Feedback
CVPR 2024
Visual Data-Type Understanding does not emerge from scaling Vision-Language Models
ICLR 2024
Iterate Averaging in the Quest for Best Test Error
JMLR 2024
SuS-X: Training-Free Name-Only Transfer of Vision-Language Models
ICCV 2023
Simple Baselines for Interactive Video Retrieval with Questions and Answers
ICCV 2023
Crosslingual Generalization through Multitask Finetuning
ACL 2023
RLIPv2: Fast Scaling of Relational Language-Image Pre-Training
ICCV 2023
Moment Detection in Long Tutorial Videos
ICCV 2023
ReCo: Retrieve and Co-segment for Zero-shot Transfer
NIPS 2022
Sign Language Video Retrieval With Free-Form Textual Queries
CVPR 2022
Cross Modal Retrieval With Querybank Normalisation
CVPR 2022
Automatic Dense Annotation of Large-Vocabulary Sign Language Videos
ECCV 2022
RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection
NIPS 2022
Read and Attend: Temporal Localisation in Sign Language Videos
CVPR 2021
Audio Retrieval with Natural Language Queries
INTERSPEECH 2021
TeachText: CrossModal Generalized Distillation for Text-Video Retrieval
ICCV 2021
Aligning Subtitles in Sign Language Videos
ICCV 2021
Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval
AAAI 2021
Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval
CVPR 2021
BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues
ECCV 2020
Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning
ICCV 2019
Unsupervised Learning of Landmarks by Descriptor Vector Exchange
ICCV 2019
Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection
CVPR 2018
Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks
NIPS 2018
Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching
CVPR 2018
Semi-convolutional Operators for Instance Segmentation
ECCV 2018
Learnable PINs: Cross-Modal Embeddings for Person Identity
ECCV 2018