Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1075 of 1723 papers

Title	Date	Tasks	Status
DAWN: Vehicle Detection in Adverse Weather Nature Dataset	Aug 12, 2020	Autonomous DrivingScene Understanding	—Unverified
Data-Driven Scene Understanding with Adaptively Retrieved Exemplars	Feb 3, 2015	Scene UnderstandingSemantic Segmentation	—Unverified
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting	Jun 9, 2025	3DGS3D Instance Segmentation	—Unverified
OpenSU3D: Open World 3D Scene Understanding using Foundation Models	Jul 19, 2024	Scene UnderstandingSpatial Reasoning	—Unverified
OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding	Feb 23, 2024	Scene Understanding	—Unverified
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation	Jul 18, 2024	Knowledge DistillationRepresentation Learning	—Unverified
Open-Vocabulary Octree-Graph for 3D Scene Understanding	Nov 25, 2024	ObjectScene Understanding	—Unverified
Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding	May 24, 2024	Scene UnderstandingZero Shot Segmentation	—Unverified
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments	Mar 29, 2025	NavigateOpen Vocabulary Semantic Segmentation	—Unverified
OW-Rep: Open World Object Detection with Instance Representation Learning	Sep 24, 2024	Novel Class DiscoveryObject	—Unverified
Optical flow and scene flow estimation: A survey	Feb 1, 2021	Action RecognitionAutonomous Driving	—Unverified
Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction	Sep 5, 2024	3DGS3D Reconstruction	—Unverified
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference	Jul 16, 2021	Scene UnderstandingSegmentation	—Unverified
DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning	Apr 9, 2024	BEV SegmentationScene Understanding	—Unverified
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion	Sep 16, 2024	Autonomous DrivingAutonomous Navigation	—Unverified
Using Image Priors to Improve Scene Understanding	Oct 2, 2019	Autonomous DrivingAutonomous Vehicles	—Unverified
Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes	Mar 7, 2024	Motion SegmentationOptical Flow Estimation	—Unverified
CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos	Jun 3, 2024	Graph GenerationScene Graph Generation	—Unverified
V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving	Apr 30, 2025	Autonomous DrivingDecision Making	—Unverified
Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation	Apr 2, 2025	3D Semantic SegmentationAdversarial Attack	—Unverified
Accelerating deep neural networks for efficient scene understanding in automotive cyber-physical systems	Jul 19, 2021	Model Compressionobject-detection	—Unverified
Cross-modal Learning for Multi-modal Video Categorization	Mar 7, 2020	Activity Recognitionobject-detection	—Unverified
Panoptic Edge Detection	Jun 3, 2019	Edge Detectionobject-detection	—Unverified
Cross-Dataset Collaborative Learning for Semantic Segmentation in Autonomous Driving	Mar 21, 2021	3D Semantic SegmentationAutonomous Driving	—Unverified
COUNT Forest: CO-Voting Uncertain Number of Targets Using Random Forest for Crowd Density Estimation	Dec 1, 2015	Density EstimationScene Understanding	—Unverified

Show:10 25 50

← PrevPage 43 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified