Joao Carreira

37 papers · 2011–2025 · 7 conferences · across top CS/AI conferences

Achievements

+14 more ↓

🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🗺️ Taxonomy Completionist (10) 🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (7)

🌉 Interdisciplinary Bridge 🌍 Conference Polyglot (7) 🗺️ Taxonomy Completionist (10) 🤝 Dynamic Duo (17) 👑 Triple Crown 🏆 Keyword Champion 👥 Mega-Team (24) 📈 Trend Setter 🚀 Conference Pioneer 🔥 Unstoppable (13) 🗃️ Keyword Collector (160) ⚡ Prolific Year (5) ❓ The Questioner (2) 💎 Century Club (37)

Conferences

CVPR (15) ICCV (8) NIPS (6) ICML (3) ECCV (2) ICLR (2) OSDI (1)

Top co-authors

Andrew Zisserman (17) Carl Doersch (10) Viorica Patraucean (6) Jean-Baptiste Alayrac (6) Jitendra Malik (6) Yi Yang (6) Daniel Zoran (5) Mateusz Malinowski (5) Skanda Koppula (5) joseph heyward (4)

Keywords

video understanding (8) self-supervised learning (7) representation learning (5) depth estimation (4) object detection (4) 3d reconstruction (4) motion estimation (3) point tracking (3) transfer learning (3) video representation (3) object reconstruction (3) action recognition (2) multimodal learning (2) 3d vision (2) neural network optimization (2) transformer architecture (2) contrastive learning (2) image segmentation (2) semantic segmentation (2) scene understanding (2)

Papers

Direct Motion Models for Assessing Generated Videos ICML 2025 Learning from Streaming Video with Orthogonal Gradients CVPR 2025 LayerLock: Non-collapsing Representation Learning with Progressive Freezing ICCV 2025 Learning from One Continuous Video Stream CVPR 2024 TAPVid-3D: A Benchmark for Tracking Any Point in 3D NIPS 2024 Moving Off-the-Grid: Scene-Grounded Video Representations NIPS 2024 Is ImageNet worth 1 video? Learning strong image encoders from 1 long unlabelled video ICLR 2024 Perception Test: A Diagnostic Benchmark for Multimodal Video Models NIPS 2023 TAPIR: Tracking Any Point with Per-Frame Initialization and Temporal Refinement ICCV 2023 Self-supervised video pretraining yields robust and more human-aligned visual representations NIPS 2023 General-purpose, long-context autoregressive modeling with Perceiver AR ICML 2022 Object Discovery and Representation Networks ECCV 2022 TAP-Vid: A Benchmark for Tracking Any Point in a Video NIPS 2022 Input-Level Inductive Biases for 3D Reconstruction CVPR 2022 Perceiver IO: A General Architecture for Structured Inputs & Outputs ICLR 2022 Efficient Visual Pretraining With Contrastive Detection ICCV 2021 Perceiver: General Perception with Iterative Attention ICML 2021 Gradient Forward-Propagation for Large-Scale Temporal Video Modelling CVPR 2021 Visual Grounding in Video for Unsupervised Word Translation CVPR 2020 Sideways: Depth-Parallel Training of Video Models CVPR 2020 Controllable Attention for Structured Layered Video Decomposition ICCV 2019 The Visual Centrifuge: Model-Free Layered Video Representations CVPR 2019 Video Action Transformer Network CVPR 2019 Massively Parallel Video Networks ECCV 2018 Quo Vadis, Action Recognition? A New Model and the Kinetics Dataset CVPR 2017 Network Requirements for Resource Disaggregation OSDI 2016 Human Pose Estimation With Iterative Error Feedback CVPR 2016 Amodal Completion and Size Constancy in Natural Scenes ICCV 2015 Pose Induction for Novel Object Categories ICCV 2015 Learning to See by Moving ICCV 2015 Virtual View Networks for Object Reconstruction CVPR 2015 Category-Specific Object Reconstruction From a Single Image CVPR 2015 Iterated Second-Order Label Sensitive Pooling for 3D Human Pose Estimation CVPR 2014 Reconstructing PASCAL VOC CVPR 2014 Beyond Hard Negative Mining: Efficient Detector Learning via Block-Circulant Decomposition ICCV 2013 Composite Statistical Inference for Semantic Segmentation CVPR 2013 Probabilistic Joint Image Segmentation and Labeling NIPS 2011