Samuel Albanie

41 papers · 2018–2025 · 10 conferences · across top CS/AI conferences

Achievements

+11 more ↓

🧭 Keyword Pioneer 🌍 Conference Polyglot (10) 🌉 Interdisciplinary Bridge 🗺️ Taxonomy Completionist (14) 🏃 Academic Marathon (7)

🏃 Academic Marathon (7) 🐝 Cross-Pollinator (10) 🌈 Renaissance Researcher (9) 🏆 Keyword Champion (2) 🧬 Topic Evolution 🚀 Conference Pioneer 🔥 Unstoppable (8) 🗃️ Keyword Collector (176) ❓ The Questioner (2) 💎 Century Club (41) ⚡ Prolific Year (5)

Conferences

CVPR (9) ICCV (9) NIPS (8) ACL (4) ECCV (4) ICLR (3) AAAI (1) INTERSPEECH (1) JMLR (1) WACV (1)

Top co-authors

Vishaal Udandarao (8) Andrew Zisserman (7) Matthias Bethge (6) Ameya Prabhu (5) Gül Varol (5) Yang Liu (5) Andrea Vedaldi (5) Kai Han (4) Jonathan Roberts (4) Liliane Momeni (4)

Keywords

benchmark evaluation (4) representation learning (4) multimodal learning (4) zero-shot learning (3) temporal localization (3) large language model (3) cross-modal retrieval (3) foundation model (3) video retrieval (3) transformer architecture (2) sign language (2) text-based retrieval (2) transfer learning (2) stochastic gradient descent (2) knowledge distillation (2) self-supervised learning (2) model evaluation (2) large multimodal model (2) zero-shot classification (2) feature learning (1)

Papers

Inverse Constitutional AI: Compressing Preferences into Principles ICLR 2025 GRAB: A Challenging GRaph Analysis Benchmark for Large Multimodal Models ICCV 2025 How to Merge Your Multimodal Models Over Time? CVPR 2025 GAMEBoT: Transparent Assessment of LLM Reasoning in Games ACL 2025 ONEBench to Test Them All: Sample-Level Benchmarking Over Open-Ended Capabilities ACL 2025 Active Data Curation Effectively Distills Large-Scale Multimodal Models CVPR 2025 Needle Threading: Can LLMs Follow Threads Through Near-Million-Scale Haystacks? ICLR 2025 DeepMIM: Deep Supervision for Masked Image Modeling WACV 2025 HelloFresh: LLM Evalutions on Streams of Real-World Human Editorial Actions across X Community Notes and Wikipedia edits ACL 2024 SciFIBench: Benchmarking Large Multimodal Models for Scientific Figure Interpretation NIPS 2024 No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance NIPS 2024 Efficient Lifelong Model Evaluation in an Era of Rapid Progress NIPS 2024 On scalable oversight with weak LLMs judging strong LLMs NIPS 2024 A Practitioner's Guide to Real-World Continual Multimodal Pretraining NIPS 2024 InstructVideo: Instructing Video Diffusion Models with Human Feedback CVPR 2024 Visual Data-Type Understanding does not emerge from scaling Vision-Language Models ICLR 2024 Iterate Averaging in the Quest for Best Test Error JMLR 2024 SuS-X: Training-Free Name-Only Transfer of Vision-Language Models ICCV 2023 Simple Baselines for Interactive Video Retrieval with Questions and Answers ICCV 2023 Crosslingual Generalization through Multitask Finetuning ACL 2023 RLIPv2: Fast Scaling of Relational Language-Image Pre-Training ICCV 2023 Moment Detection in Long Tutorial Videos ICCV 2023 ReCo: Retrieve and Co-segment for Zero-shot Transfer NIPS 2022 Sign Language Video Retrieval With Free-Form Textual Queries CVPR 2022 Cross Modal Retrieval With Querybank Normalisation CVPR 2022 Automatic Dense Annotation of Large-Vocabulary Sign Language Videos ECCV 2022 RLIP: Relational Language-Image Pre-training for Human-Object Interaction Detection NIPS 2022 Read and Attend: Temporal Localisation in Sign Language Videos CVPR 2021 Audio Retrieval with Natural Language Queries INTERSPEECH 2021 TeachText: CrossModal Generalized Distillation for Text-Video Retrieval ICCV 2021 Aligning Subtitles in Sign Language Videos ICCV 2021 Mind-the-Gap! Unsupervised Domain Adaptation for Text-Video Retrieval AAAI 2021 Adaptive Cross-Modal Prototypes for Cross-Domain Visual-Language Retrieval CVPR 2021 BSL-1K: Scaling up co-articulated sign language recognition using mouthing cues ECCV 2020 Small Steps and Giant Leaps: Minimal Newton Solvers for Deep Learning ICCV 2019 Unsupervised Learning of Landmarks by Descriptor Vector Exchange ICCV 2019 Self-Supervised Learning of Geometrically Stable Features Through Probabilistic Introspection CVPR 2018 Gather-Excite: Exploiting Feature Context in Convolutional Neural Networks NIPS 2018 Seeing Voices and Hearing Faces: Cross-Modal Biometric Matching CVPR 2018 Semi-convolutional Operators for Instance Segmentation ECCV 2018 Learnable PINs: Cross-Modal Embeddings for Person Identity ECCV 2018