Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Analysis
Computer Vision
›
Analysis
›
Scene Understanding
1887 directly classified papers
Papers per year
2006: 14
2007: 12
2008: 12
2009: 20
2010: 14
2011: 13
2012: 13
2013: 108
2014: 43
2015: 83
2016: 42
2017: 61
2018: 58
2019: 138
2020: 128
2021: 197
2022: 132
2023: 222
2024: 243
2025: 287
2026: 47
Papers
CAESAR: An Embodied Simulator for Generating Multimodal Referring Expression Datasets
NIPS 2022
A Survey on Machine Learning Approaches for Modelling Intuitive Physics
IJCAI 2022
QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning
NAACL 2022
SVTR: Scene Text Recognition with a Single Visual Model
IJCAI 2022
Towards Better Semantic Understanding of Mobile Interfaces
COLING 2022
CrossLocate: Cross-Modal Large-Scale Visual Geo-Localization in Natural Environments Using Rendered Modalities
WACV 2022
HL-Net: Heterophily Learning Network for Scene Graph Generation
CVPR 2022
ELSR: Efficient Line Segment Reconstruction With Planes and Points Guidance
CVPR 2022
Find Someone Who: Visual Commonsense Understanding in Human-Centric Grounding
EMNLP 2022
Text2Pos: Text-to-Point-Cloud Cross-Modal Localization
CVPR 2022
Stability-Driven Contact Reconstruction From Monocular Color Images
CVPR 2022
SGTR: End-to-End Scene Graph Generation With Transformer
CVPR 2022
Efficient Large-Scale Localization by Global Instance Recognition
CVPR 2022
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing
CVPR 2022
Weakly but Deeply Supervised Occlusion-Reasoned Parametric Road Layouts
CVPR 2022
There’s a Time and Place for Reasoning Beyond the Image
ACL 2022
SCONE: Surface Coverage Optimization in Unknown Environments by Volumetric Integration
NIPS 2022
Flexible Visual Grounding
ACL 2022
Vision Transformers provably learn spatial structure
NIPS 2022
Bridging the Gap Between Learning in Discrete and Continuous Environments for Vision-and-Language Navigation
CVPR 2022
Breaking Bad: A Dataset for Geometric Fracture and Reassembly
NIPS 2022
Fine-Grained Predicates Learning for Scene Graph Generation
CVPR 2022
Visual Commonsense in Pretrained Unimodal and Multimodal Models
NAACL 2022
Amodal Panoptic Segmentation
CVPR 2022
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition
CVPR 2022
<
1
…
33
34
35
…
76
>