Research Explorer
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Papers
Trends
Conferences
Explore
Authors
Topics
Keywords
Achievements
About
Methodology
← Analysis
Computer Vision
›
Analysis
›
Scene Understanding
1887 directly classified papers
Papers per year
2006: 14
2007: 12
2008: 12
2009: 20
2010: 14
2011: 13
2012: 13
2013: 108
2014: 43
2015: 83
2016: 42
2017: 61
2018: 58
2019: 138
2020: 128
2021: 197
2022: 132
2023: 222
2024: 243
2025: 287
2026: 47
Papers
TopViewRS: Vision-Language Models as Top-View Spatial Reasoners
EMNLP 2024
Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories
EMNLP 2024
SphereCraft: A Dataset for Spherical Keypoint Detection, Matching and Camera Pose Estimation
WACV 2024
RSMPNet: Relationship Guided Semantic Map Prediction
WACV 2024
Analyzing the Domain Shift Immunity of Deep Homography Estimation
WACV 2024
Angle Robustness Unmanned Aerial Vehicle Navigation in GNSS-Denied Scenarios
AAAI 2024
Beyond RGB: A Real World Dataset for Multispectral Imaging in Mobile Devices
WACV 2024
Sparse Convolutional Networks for Surface Reconstruction From Noisy Point Clouds
WACV 2024
CommVQA: Situating Visual Question Answering in Communicative Contexts
EMNLP 2024
Understanding Bias in Large-Scale Visual Datasets
NIPS 2024
CPN: Complementary Proposal Network for Unconstrained Text Detection
AAAI 2024
BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping
WACV 2024
Sound3DVDet: 3D Sound Source Detection Using Multiview Microphone Array and RGB Images
WACV 2024
Semi-Supervised Scene Change Detection by Distillation From Feature-Metric Alignment
WACV 2024
“Image, Tell me your story!” Predicting the original meta-context of visual misinformation
EMNLP 2024
TSA2: Temporal Segment Adaptation and Aggregation for Video Harmonization
WACV 2024
NITEC: Versatile Hand-Annotated Eye Contact Dataset for Ego-Vision Interaction
WACV 2024
A Theory of Joint Light and Heat Transport for Lambertian Scenes
CVPR 2024
Rank2Tell: A Multimodal Driving Dataset for Joint Importance Ranking and Reasoning
WACV 2024
RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-Centric Learning
WACV 2024
Can CLIP Help Sound Source Localization?
WACV 2024
Self-Supervised Relation Alignment for Scene Graph Generation
WACV 2024
FocusTune: Tuning Visual Localization Through Focus-Guided Sampling
WACV 2024
No More Ambiguity in 360deg Room Layout via Bi-Layout Estimation
CVPR 2024
SpatialPIN: Enhancing Spatial Reasoning Capabilities of Vision-Language Models through Prompting and Interacting 3D Priors
NIPS 2024
<
1
…
18
19
20
…
76
>