Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1476–1500 of 1723 papers

Title	Date	Tasks	Status
Part-Whole Relational Fusion Towards Multi-Modal Scene Understanding	Oct 19, 2024	Autonomous Drivingobject-detection	CodeCode Available
Parsing Natural Scenes and Natural Language with Recursive Neural Networks	Jun 1, 2011	General ClassificationScene Classification	CodeCode Available
Parsing Geometry Using Structure-Aware Shape Templates	Aug 3, 2018	ObjectObject Recognition	CodeCode Available
Parallel Neural Computing for Scene Understanding from LiDAR Perception in Autonomous Racing	Dec 24, 2024	Autonomous DrivingAutonomous Racing	CodeCode Available
Sequential Cross Attention Based Multi-task Learning	Sep 6, 2022	Multi-Task LearningScene Understanding	CodeCode Available
PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video	Jan 1, 2024	3D Panoptic Segmentation3D Reconstruction	CodeCode Available
Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes	Aug 18, 2021	Camera CalibrationDepth Estimation	CodeCode Available
P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation	Oct 23, 2023	Autonomous DrivingDecoder	CodeCode Available
SGDraw: Scene Graph Drawing Interface Using Object-Oriented Representation	Nov 30, 2022	Graph GenerationImage Generation	CodeCode Available
Pose-aware Multi-level Feature Network for Human Object Interaction Detection	Sep 18, 2019	Human-Object Interaction DetectionObject	CodeCode Available
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	Dec 31, 2024	3DGS3D Semantic Segmentation	CodeCode Available
Dilated Residual Networks	May 28, 2017	ClassificationGeneral Classification	CodeCode Available
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic Segmentation	Sep 24, 2018	Autonomous DrivingReal-Time Semantic Segmentation	CodeCode Available
OVeNet: Offset Vector Network for Semantic Segmentation	Mar 25, 2023	Optical Character Recognition (OCR)Scene Understanding	CodeCode Available
Unsupervised Domain Adaptation using Generative Adversarial Networks for Semantic Segmentation of Aerial Images	May 8, 2019	Domain AdaptationManagement	CodeCode Available
Predicting Deeper into the Future of Semantic Segmentation	Mar 22, 2017	AttributeAutonomous Driving	CodeCode Available
Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data	Jul 14, 2024	3D Object Detection3D Semantic Segmentation	CodeCode Available
Shape Anchor Guided Holistic Indoor Scene Understanding	Sep 20, 2023	3D Object Detectionobject-detection	CodeCode Available
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	Jun 10, 2022	Autonomous DrivingDomain Adaptation	CodeCode Available
Improving Object Detection for Time-Lapse Imagery Using Temporal Features in Wildlife Monitoring	Dec 20, 2024	Objectobject-detection	CodeCode Available
OST-Bench: Evaluating the Capabilities of MLLMs in Online Spatio-temporal Scene Understanding	Jul 10, 2025	Scene UnderstandingSpatial Reasoning	CodeCode Available
OpenOcc: Open Vocabulary 3D Scene Reconstruction via Occupancy Representation	Mar 18, 2024	3D Reconstruction3D Scene Reconstruction	CodeCode Available
Impact of Ground Truth Annotation Quality on Performance of Semantic Image Segmentation of Traffic Conditions	Dec 30, 2018	Autonomous DrivingImage Segmentation	CodeCode Available
On the Structures of Representation for the Robustness of Semantic Segmentation to Input Corruption	Sep 2, 2020	Scene UnderstandingSegmentation	CodeCode Available
Instance-Warp: Saliency Guided Image Warping for Unsupervised Domain Adaptation	Mar 19, 2024	Domain AdaptationObject	CodeCode Available

Show:10 25 50

← PrevPage 60 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified