Ross Girshick

60 papers · 2013–2025 · 9 conferences · across top CS/AI conferences

Achievements

+18 more ↓

🌍 Conference Polyglot (9) 🐣 Hot Topic Early Bird 🧭 Keyword Pioneer 🌉 Interdisciplinary Bridge 🏃 Academic Marathon (12)

🏃 Academic Marathon (12) 🧭 Keyword Pioneer 🐣 Hot Topic Early Bird 🌟 Keyword Trendsetter Combo (4) 🏠 Conference Loyalist (34) 🔬 Deep Specialist (22) 🧬 Topic Evolution 🌱 Topic Pioneer 🏆 Keyword Champion (2) 👥 Mega-Team (50) 🤝 Dynamic Duo (24) 🗃️ Keyword Collector (200) ⚡ Prolific Year (7) 📈 Trend Setter 🔥 Unstoppable (13) 🚀 Conference Pioneer 💎 Century Club (60) ❓ The Questioner

Conferences

CVPR (34) ICCV (13) ECCV (3) ICML (3) NIPS (3) CORL (1) ICLR (1) INTERSPEECH (1) NAACL (1)

Top co-authors

Kaiming He (24) Piotr Dollár (22) Bharath Hariharan (8) Jitendra Malik (8) Trevor Darrell (7) Georgia Gkioxari (6) Alexander Kirillov (6) Laurens van der Maaten (6) Christoph Feichtenhofer (5) Saining Xie (5)

Keywords

object detection (20) convolutional neural network (14) semantic segmentation (9) transfer learning (8) instance segmentation (7) image classification (6) pose estimation (5) self-supervised learning (4) image segmentation (4) feature extraction (4) deformable part model (4) panoptic segmentation (3) unsupervised learning (3) action recognition (3) weakly supervised learning (3) visual recognition (3) representation learning (3) visual question answering (3) image captioning (2) model pretraining (2)

Papers

SAM 2: Segment Anything in Images and Videos ICLR 2025 Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models CVPR 2025 PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators CORL 2024 Segment Anything ICCV 2023 The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining ICCV 2023 Revisiting Weakly Supervised Pre-Training of Visual Perception Models CVPR 2022 Masked Autoencoders Are Scalable Vision Learners CVPR 2022 Exploring Plain Vision Transformer Backbones for Object Detection ECCV 2022 Boundary IoU: Improving Object-Centric Image Segmentation Evaluation CVPR 2021 A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning CVPR 2021 Fast and Accurate Model Scaling CVPR 2021 Are Labels Necessary for Neural Architecture Search? ECCV 2020 Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR INTERSPEECH 2020 A Multigrid Method for Efficiently Training Video Models CVPR 2020 PointRend: Image Segmentation As Rendering CVPR 2020 Momentum Contrast for Unsupervised Visual Representation Learning CVPR 2020 Designing Network Design Spaces CVPR 2020 TensorMask: A Foundation for Dense Object Segmentation ICCV 2019 Long-Term Feature Banks for Detailed Video Understanding CVPR 2019 LVIS: A Dataset for Large Vocabulary Instance Segmentation CVPR 2019 Panoptic Feature Pyramid Networks CVPR 2019 Exploring Randomly Wired Neural Networks for Image Recognition ICCV 2019 Panoptic Segmentation CVPR 2019 PHYRE: A New Benchmark for Physical Reasoning NIPS 2019 Rethinking ImageNet Pre-Training ICCV 2019 Learning by Asking Questions CVPR 2018 Data Distillation: Towards Omni-Supervised Learning CVPR 2018 Learning to Segment Every Thing CVPR 2018 Low-Shot Learning From Imaginary Data CVPR 2018 Non-Local Neural Networks CVPR 2018 Detecting and Recognizing Human-Object Interactions CVPR 2018 Exploring the Limits of Weakly Supervised Pretraining ECCV 2018 Mask R-CNN ICCV 2017 Aggregated Residual Transformations for Deep Neural Networks CVPR 2017 Feature Pyramid Networks for Object Detection CVPR 2017 Learning Features by Watching Objects Move CVPR 2017 CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning CVPR 2017 Focal Loss for Dense Object Detection ICCV 2017 Inferring and Executing Programs for Visual Reasoning ICCV 2017 Low-Shot Visual Recognition by Shrinking and Hallucinating Features ICCV 2017 Training Region-Based Object Detectors With Online Hard Example Mining CVPR 2016 Unsupervised Deep Embedding for Clustering Analysis ICML 2016 Visual Storytelling NAACL 2016 Seeing Through the Human Reporting Bias: Visual Classifiers From Noisy Human-Centric Labels CVPR 2016 Inside-Outside Net: Detecting Objects in Context With Skip Pooling and Recurrent Neural Networks CVPR 2016 You Only Look Once: Unified, Real-Time Object Detection CVPR 2016 Aligning 3D Models to RGB-D Images of Cluttered Scenes CVPR 2015 Hypercolumns for Object Segmentation and Fine-Grained Localization CVPR 2015 Actions and Attributes From Wholes and Parts ICCV 2015 Contextual Action Recognition With R*CNN ICCV 2015 Fast R-CNN ICCV 2015 Deformable Part Models are Convolutional Neural Networks CVPR 2015 Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks NIPS 2015 LSDA: Large Scale Detection through Adaptation NIPS 2014 On learning to localize objects with minimal supervision ICML 2014 Understanding Objects in Detail with Fine-Grained Attributes CVPR 2014 Using k-Poselets for Detecting People and Localizing Their Keypoints CVPR 2014 Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation CVPR 2014 Training Deformable Part Models with Decorrelated Features ICCV 2013 Discriminatively Activated Sparselets ICML 2013