Ross Girshick
60 papers · 2013–2025 · 9 conferences · across top CS/AI conferences
Achievements
Jump to papers ↓+18 more ↓ Show less ↑
๐ Conference Polyglot (9) ๐ฃ Hot Topic Early Bird ๐งญ Keyword Pioneer ๐ Interdisciplinary Bridge ๐ Academic Marathon (12)
๐
Academic Marathon
(12)
๐งญ
Keyword Pioneer
๐ฃ
Hot Topic Early Bird
๐
Keyword Trendsetter Combo
(4)
๐
Conference Loyalist
(34)
๐ฌ
Deep Specialist
(22)
๐งฌ
Topic Evolution
๐ฑ
Topic Pioneer
๐
Keyword Champion
(2)
๐ฅ
Mega-Team
(50)
๐ค
Dynamic Duo
(24)
๐๏ธ
Keyword Collector
(200)
โก
Prolific Year
(7)
๐
Trend Setter
๐ฅ
Unstoppable
(13)
๐
Conference Pioneer
๐
Century Club
(60)
โ
The Questioner
Conferences
CVPR (34)
ICCV (13)
ECCV (3)
ICML (3)
NIPS (3)
CORL (1)
ICLR (1)
INTERSPEECH (1)
NAACL (1)
Top co-authors
Keywords
object detection
(20)
convolutional neural network
(14)
semantic segmentation
(9)
transfer learning
(8)
instance segmentation
(7)
image classification
(6)
pose estimation
(5)
self-supervised learning
(4)
image segmentation
(4)
feature extraction
(4)
deformable part model
(4)
panoptic segmentation
(3)
unsupervised learning
(3)
action recognition
(3)
weakly supervised learning
(3)
visual recognition
(3)
representation learning
(3)
visual question answering
(3)
image captioning
(2)
model pretraining
(2)
Papers
SAM 2: Segment Anything in Images and Videos
ICLR 2025
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language Models
CVPR 2025
PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
CORL 2024
Segment Anything
ICCV 2023
The Effectiveness of MAE Pre-Pretraining for Billion-Scale Pretraining
ICCV 2023
Revisiting Weakly Supervised Pre-Training of Visual Perception Models
CVPR 2022
Masked Autoencoders Are Scalable Vision Learners
CVPR 2022
Exploring Plain Vision Transformer Backbones for Object Detection
ECCV 2022
Boundary IoU: Improving Object-Centric Image Segmentation Evaluation
CVPR 2021
A Large-Scale Study on Unsupervised Spatiotemporal Representation Learning
CVPR 2021
Fast and Accurate Model Scaling
CVPR 2021
Are Labels Necessary for Neural Architecture Search?
ECCV 2020
Large Scale Weakly and Semi-Supervised Learning for Low-Resource Video ASR
INTERSPEECH 2020
A Multigrid Method for Efficiently Training Video Models
CVPR 2020
PointRend: Image Segmentation As Rendering
CVPR 2020
Momentum Contrast for Unsupervised Visual Representation Learning
CVPR 2020
Designing Network Design Spaces
CVPR 2020
TensorMask: A Foundation for Dense Object Segmentation
ICCV 2019
Long-Term Feature Banks for Detailed Video Understanding
CVPR 2019
LVIS: A Dataset for Large Vocabulary Instance Segmentation
CVPR 2019
Panoptic Feature Pyramid Networks
CVPR 2019
Exploring Randomly Wired Neural Networks for Image Recognition
ICCV 2019
Panoptic Segmentation
CVPR 2019
PHYRE: A New Benchmark for Physical Reasoning
NIPS 2019
Rethinking ImageNet Pre-Training
ICCV 2019
Learning by Asking Questions
CVPR 2018
Data Distillation: Towards Omni-Supervised Learning
CVPR 2018
Learning to Segment Every Thing
CVPR 2018
Low-Shot Learning From Imaginary Data
CVPR 2018
Non-Local Neural Networks
CVPR 2018
Detecting and Recognizing Human-Object Interactions
CVPR 2018
Exploring the Limits of Weakly Supervised Pretraining
ECCV 2018
Mask R-CNN
ICCV 2017
Aggregated Residual Transformations for Deep Neural Networks
CVPR 2017
Feature Pyramid Networks for Object Detection
CVPR 2017
Learning Features by Watching Objects Move
CVPR 2017
CLEVR: A Diagnostic Dataset for Compositional Language and Elementary Visual Reasoning
CVPR 2017
Focal Loss for Dense Object Detection
ICCV 2017
Inferring and Executing Programs for Visual Reasoning
ICCV 2017
Low-Shot Visual Recognition by Shrinking and Hallucinating Features
ICCV 2017
Training Region-Based Object Detectors With Online Hard Example Mining
CVPR 2016
Unsupervised Deep Embedding for Clustering Analysis
ICML 2016
Visual Storytelling
NAACL 2016
Seeing Through the Human Reporting Bias: Visual Classifiers From Noisy Human-Centric Labels
CVPR 2016
Inside-Outside Net: Detecting Objects in Context With Skip Pooling and Recurrent Neural Networks
CVPR 2016
You Only Look Once: Unified, Real-Time Object Detection
CVPR 2016
Aligning 3D Models to RGB-D Images of Cluttered Scenes
CVPR 2015
Hypercolumns for Object Segmentation and Fine-Grained Localization
CVPR 2015
Actions and Attributes From Wholes and Parts
ICCV 2015
Contextual Action Recognition With R*CNN
ICCV 2015
Fast R-CNN
ICCV 2015
Deformable Part Models are Convolutional Neural Networks
CVPR 2015
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks
NIPS 2015
LSDA: Large Scale Detection through Adaptation
NIPS 2014
On learning to localize objects with minimal supervision
ICML 2014
Understanding Objects in Detail with Fine-Grained Attributes
CVPR 2014
Using k-Poselets for Detecting People and Localizing Their Keypoints
CVPR 2014
Rich Feature Hierarchies for Accurate Object Detection and Semantic Segmentation
CVPR 2014
Training Deformable Part Models with Decorrelated Features
ICCV 2013
Discriminatively Activated Sparselets
ICML 2013