Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1200 of 1723 papers

Title	Date	Tasks	Status
Scene Understanding for Autonomous Driving	May 11, 2021	Autonomous DrivingScene Understanding	—Unverified
Scene Understanding in Pick-and-Place Tasks: Analyzing Transformations Between Initial and Final Scenes	Sep 26, 2024	object-detectionObject Detection	—Unverified
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System	May 18, 2018	3D Object DetectionAutonomous Driving	—Unverified
SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding	Jan 17, 2024	3D visual groundingScene Understanding	—Unverified
SDNet: Semantically Guided Depth Estimation Network	Jul 24, 2019	Autonomous VehiclesDepth Estimation	—Unverified
SE(3) Equivariant Ray Embeddings for Implicit Multi-View Depth Estimation	Nov 11, 2024	Data AugmentationDecoder	—Unverified
SeaDSC: A video-based unsupervised method for dynamic scene change detection in unmanned surface vehicles	Nov 20, 2023	Change DetectionMotion Planning	—Unverified
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Jul 19, 2022	Image RetrievalRetrieval	—Unverified
Second-order Democratic Aggregation	Aug 22, 2018	General ClassificationMaterial Classification	—Unverified
Neural Groundplans: Persistent Neural Scene Representations from a Single Image	Jul 22, 2022	DisentanglementInstance Segmentation	—Unverified
Seeing Beyond Classes: Zero-Shot Grounded Situation Recognition via Language Explainer	Apr 24, 2024	Grounded Situation RecognitionScene Understanding	—Unverified
Seeing Beyond the Scene: Enhancing Vision-Language Models with Interactional Reasoning	May 14, 2025	Relation ExtractionScene Understanding	—Unverified
Seeing the Signs: A Survey of Edge-Deployable OCR Models for Billboard Visibility Analysis	Jul 15, 2025	MarketingOptical Character Recognition	—Unverified
Seeing with Humans: Gaze-Assisted Neural Image Captioning	Aug 18, 2016	Image CaptioningObject	—Unverified
Seeing With Sound: Long-range Acoustic Beamforming for Multimodal Scene Understanding	Jan 1, 2023	Autonomous Vehiclesobject-detection	—Unverified
Segment Any 3D Gaussians	Dec 1, 2023	Interactive SegmentationScene Understanding	—Unverified
Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation	Mar 16, 2024	Instance SegmentationObject	—Unverified
Segment Any RGB-Thermal Model with Language-aided Distillation	May 4, 2025	Instance SegmentationKnowledge Distillation	—Unverified
Segment Anything, Even Occluded	Mar 8, 2025	Amodal Instance SegmentationAutonomous Driving	—Unverified
Segmentation Guided Attention Networks for Visual Question Answering	Jul 1, 2017	Common Sense ReasoningQuestion Answering	—Unverified
Segmentation-guided Domain Adaptation for Efficient Depth Completion	Oct 14, 2022	Depth CompletionDomain Adaptation	—Unverified
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation	Jan 1, 2022	3D Semantic SegmentationAutonomous Driving	—Unverified
Self-Supervised and Generalizable Tokenization for CLIP-Based 3D Understanding	May 24, 2025	Domain GeneralizationRepresentation Learning	—Unverified
Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos	Aug 9, 2021	3D geometry3D Geometry Perception	—Unverified
Self-supervised Learning via Cluster Distance Prediction for Operating Room Context Awareness	Jul 7, 2024	Activity RecognitionScene Understanding	—Unverified
Self-Supervised Object Detection from Egocentric Videos	Jan 1, 2023	Class-agnostic Object DetectionObject	—Unverified
Self-supervised Pre-training with Masked Shape Prediction for 3D Scene Understanding	May 8, 2023	PredictionScene Understanding	—Unverified
Self-Supervised Relative Depth Learning for Urban Scene Understanding	Dec 13, 2017	Depth EstimationMonocular Depth Estimation	—Unverified
SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints	Apr 20, 2022	Autonomous DrivingScene Understanding	—Unverified
Semantic Augmented Reality Environment with Material-Aware Physical Interactions	Aug 3, 2017	Scene Understanding	—Unverified
Semantic-aware Transmission for Robust Point Cloud Classification	Jun 23, 2023	ClassificationDecoder	—Unverified
Semantic Dense Reconstruction with Consistent Scene Segments	Sep 30, 2021	3D Scene ReconstructionScene Understanding	—Unverified
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety	Oct 22, 2021	Scene Understanding	—Unverified
SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments	Mar 19, 2025	Autonomous DrivingComputational Efficiency	—Unverified
Semantic Foggy Scene Understanding with Synthetic Data	Aug 25, 2017	Image Dehazingobject-detection	—Unverified
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting	Mar 22, 2024	Instance SegmentationObject Localization	—Unverified
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer	Nov 10, 2015	Object RecognitionScene Understanding	—Unverified
Semantic Is Enough: Only Semantic Information For NeRF Reconstruction	Mar 24, 2024	NeRFobject-detection	—Unverified
Semantic Motion Segmentation Using Dense CRF Formulation	Apr 24, 2015	Motion DetectionMotion Segmentation	—Unverified
Semantic Pose using Deep Networks Trained on Synthetic RGB-D	Aug 4, 2015	GPUScene Understanding	—Unverified
Semantic segmentation of surgical hyperspectral images under geometric domain shifts	Mar 20, 2023	Organ SegmentationScene Segmentation	—Unverified
SemanticSplat: Feed-Forward 3D Scene Understanding with Language-Aware Gaussian Fields	Jun 11, 2025	3D ReconstructionScene Understanding	—Unverified
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification	Mar 25, 2022	RetrievalScene Understanding	—Unverified
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation	Aug 28, 2023	Autonomous VehiclesDepth Estimation	—Unverified
Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes	Jun 17, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Jun 2, 2024	Scene ParsingScene Understanding	—Unverified
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery	Nov 20, 2018	Scene UnderstandingSegmentation	—Unverified
Separated Inter/Intra-Modal Fusion Prompts for Compositional Zero-Shot Learning	Jan 22, 2025	AttributeCompositional Zero-Shot Learning	—Unverified

Show:10 25 50

← PrevPage 24 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified