Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1175 of 1723 papers

Title	Date	Tasks	Status
Scene Understanding for Autonomous Driving	May 11, 2021	Autonomous DrivingScene Understanding	—Unverified
Scene Understanding in Pick-and-Place Tasks: Analyzing Transformations Between Initial and Final Scenes	Sep 26, 2024	object-detectionObject Detection	—Unverified
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System	May 18, 2018	3D Object DetectionAutonomous Driving	—Unverified
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding	Jan 17, 2024	3D visual groundingScene Understanding	—Unverified
SDNet: Semantically Guided Depth Estimation Network	Jul 24, 2019	Autonomous VehiclesDepth Estimation	—Unverified
SE(3) Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Nov 11, 2024	Data AugmentationDecoder	—Unverified
SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles	Nov 20, 2023	Change DetectionMotion Planning	—Unverified
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Jul 19, 2022	Image RetrievalRetrieval	—Unverified
Second-order Democratic Aggregation	Aug 22, 2018	General ClassificationMaterial Classification	—Unverified
Neural Groundplans: Persistent Neural Scene Representations from a Single Image	Jul 22, 2022	DisentanglementInstance Segmentation	—Unverified
Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer	Apr 24, 2024	Grounded Situation RecognitionScene Understanding	—Unverified
Seeing Beyond the Scene: Enhancing Vision-Language Models with Interactional Reasoning	May 14, 2025	Relation ExtractionScene Understanding	—Unverified
Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis	Jul 15, 2025	MarketingOptical Character Recognition	—Unverified
Seeing with Humans: Gaze-Assisted Neural Image Captioning	Aug 18, 2016	Image CaptioningObject	—Unverified
Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding	Jan 1, 2023	Autonomous Vehiclesobject-detection	—Unverified
Segment Any 3D Gaussians	Dec 1, 2023	Interactive SegmentationScene Understanding	—Unverified
Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation	Mar 16, 2024	Instance SegmentationObject	—Unverified
Segment Any RGB-Thermal Model with Language-aided Distillation	May 4, 2025	Instance SegmentationKnowledge Distillation	—Unverified
Segment Anything, Even Occluded	Mar 8, 2025	Amodal Instance SegmentationAutonomous Driving	—Unverified
Segmentation Guided Attention Networks for Visual Question Answering	Jul 1, 2017	Common Sense ReasoningQuestion Answering	—Unverified
Segmentation-guided Domain Adaptation for Efficient Depth Completion	Oct 14, 2022	Depth CompletionDomain Adaptation	—Unverified
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation	Jan 1, 2022	3D Semantic SegmentationAutonomous Driving	—Unverified
Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding	May 24, 2025	Domain GeneralizationRepresentation Learning	—Unverified
Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos	Aug 9, 2021	3D geometry3D Geometry Perception	—Unverified
Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Jul 7, 2024	Activity RecognitionScene Understanding	—Unverified

Show:10 25 50

← PrevPage 47 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified