Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1126–1150 of 1723 papers

Title	Date	Tasks	Status
Learning-based Relational Object Matching Across Views	May 3, 2023	Graph Neural NetworkImage Retrieval	—Unverified
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation	Sep 28, 2020	Instance SegmentationPanoptic Segmentation	—Unverified
Learning Densities in Feature Space for Reliable Segmentation of Indoor Scenes	Aug 1, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length	Mar 27, 2018	Depth EstimationNetwork Embedding	—Unverified
Learning Direct Optimization for Scene Understanding	Dec 18, 2018	Scene Understanding	—Unverified
Learning from Maps: Visual Common Sense for Autonomous Driving	Nov 25, 2016	Autonomous DrivingAutonomous Vehicles	—Unverified
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation	Jan 26, 2023	FairnessLIDAR Semantic Segmentation	—Unverified
Learning Geometric-Aware Properties in 2D Representation Using Lightweight CAD Models, or Zero Real 3D Pairs	Jan 1, 2023	Scene Understanding	—Unverified
Learning in Audio-visual Context: A Review, Analysis, and New Perspective	Aug 20, 2022	audio-visual learningScene Understanding	—Unverified
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes	Jan 1, 2024	Instance SegmentationMotion Estimation	—Unverified
SceneGPT: A Language Model for 3D Scene Understanding	Aug 13, 2024	In-Context LearningLanguage Modeling	—Unverified
Scene Graph Generation: A Comprehensive Survey	Jan 3, 2022	Graph Generationobject-detection	—Unverified
A Comprehensive Survey of Scene Graphs: Generation and Application	Mar 17, 2021	Image CaptioningQuestion Answering	—Unverified
Scene Image is Non-Mutually Exclusive - A Fuzzy Qualitative Scene Understanding	Oct 14, 2014	Binary ClassificationDecision Making	—Unverified
Scene-Independent Group Profiling in Crowd	Jun 1, 2014	Scene Understanding	—Unverified
SceneNet RGB-D: Can 5M Synthetic Images Beat Generic ImageNet Pre-Training on Indoor Segmentation?	Oct 1, 2017	16kCamera Pose Estimation	—Unverified
Scene-R1: Video-Grounded Large Language Models for 3D Scene Reasoning without 3D Annotations	Jun 21, 2025	Question AnsweringScene Understanding	—Unverified
Scene recognition based on DNN and game theory with its applications in human-robot interaction	Dec 3, 2019	Image RegistrationScene Recognition	—Unverified
SceneSplat++: A Large Dataset and Comprehensive Benchmark for Language Gaussian Splatting	Jun 10, 2025	3DGSScene Understanding	—Unverified
Scene Summarization: Clustering Scene Videos into Spatially Diverse Frames	Nov 28, 2023	ClusteringDiversity	—Unverified
SceneTAP: Scene-Coherent Typographic Adversarial Planner against Vision-Language Models in Real-World Environments	Nov 28, 2024	Adversarial TextScene Understanding	—Unverified
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate	Dec 26, 2020	Scene Text DetectionScene Understanding	—Unverified
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified
Scene Understanding Enabled Semantic Communication with Open Channel Coding	Jan 24, 2025	Question AnsweringScene Understanding	—Unverified
Scene Understanding for Autonomous Manipulation with Deep Learning	Mar 23, 2019	Action UnderstandingAffordance Detection	—Unverified

Show:10 25 50

← PrevPage 46 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified