Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1351–1400 of 1723 papers

Title	Date	Tasks	Status
Beyond Point Clouds: Scene Understanding by Reasoning Geometry and Physics	Jun 1, 2013	Scene UnderstandingSemantic Segmentation	—Unverified
Semantic Augmented Reality Environment with Material-Aware Physical Interactions	Aug 3, 2017	Scene Understanding	—Unverified
Semantic-aware Transmission for Robust Point Cloud Classification	Jun 23, 2023	ClassificationDecoder	—Unverified
Semantic Dense Reconstruction with Consistent Scene Segments	Sep 30, 2021	3D Scene ReconstructionScene Understanding	—Unverified
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety	Oct 22, 2021	Scene Understanding	—Unverified
SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments	Mar 19, 2025	Autonomous DrivingComputational Efficiency	—Unverified
Semantic Foggy Scene Understanding with Synthetic Data	Aug 25, 2017	Image Dehazingobject-detection	—Unverified
Classification of Single-View Object Point Clouds	Dec 18, 2020	3D Object Classification6D Pose Estimation using RGB	—Unverified
Semantic Gaussians: Open-Vocabulary Scene Understanding with 3D Gaussian Splatting	Mar 22, 2024	Instance SegmentationObject Localization	—Unverified
Semantic Instance Annotation of Street Scenes by 3D to 2D Label Transfer	Nov 10, 2015	Object RecognitionScene Understanding	—Unverified
Semantic Is Enough: Only Semantic Information For NeRF Reconstruction	Mar 24, 2024	NeRFobject-detection	—Unverified
Beyond Categories: The Visual Memex Model for Reasoning About Object Relationships	Dec 1, 2009	ObjectScene Understanding	—Unverified
Semantic Motion Segmentation Using Dense CRF Formulation	Apr 24, 2015	Motion DetectionMotion Segmentation	—Unverified
Semantic Pose using Deep Networks Trained on Synthetic RGB-D	Aug 4, 2015	GPUScene Understanding	—Unverified
Benchmarking Vision Language Models for Cultural Understanding	Jul 15, 2024	BenchmarkingQuestion Answering	—Unverified
BEHAVIOR Vision Suite: Customizable Dataset Generation via Simulation	May 15, 2024	Dataset GenerationScene Understanding	—Unverified
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery	Apr 27, 2022	object-detectionObject Detection	—Unverified
Semantic segmentation of surgical hyperspectral images under geometric domain shifts	Mar 20, 2023	Organ SegmentationScene Segmentation	—Unverified
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics	Oct 20, 2020	Decision MakingLogical Reasoning	—Unverified
VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry	Apr 23, 2018	Outdoor LocalizationScene Understanding	—Unverified
VLP: Vision Language Planning for Autonomous Driving	Jan 10, 2024	Autonomous DrivingMotion Planning	—Unverified
SemanticSplat: Feed-Forward 3D Scene Understanding with Language-Aware Gaussian Fields	Jun 11, 2025	3D ReconstructionScene Understanding	—Unverified
3D Object Aided Self-Supervised Monocular Depth Estimation	Dec 4, 2022	3D Object DetectionAutonomous Driving	—Unverified
Semi-supervised and Deep learning Frameworks for Video Classification and Key-frame Identification	Mar 25, 2022	RetrievalScene Understanding	—Unverified
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding	Dec 14, 2023	Scene UnderstandingTransfer Learning	—Unverified
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation	Aug 28, 2023	Autonomous VehiclesDepth Estimation	—Unverified
Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes	Jun 17, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Jun 2, 2024	Scene ParsingScene Understanding	—Unverified
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism	Jul 10, 2021	Depth EstimationMonocular Depth Estimation	—Unverified
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features	Jan 17, 2025	Language ModelingLanguage Modelling	—Unverified
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery	Nov 20, 2018	Scene UnderstandingSegmentation	—Unverified
Separated Inter/Intra-Modal Fusion Prompts for Compositional Zero-Shot Learning	Jan 22, 2025	AttributeCompositional Zero-Shot Learning	—Unverified
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving	May 18, 2025	Autonomous DrivingAutonomous Vehicles	—Unverified
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation	Jun 26, 2024	DecoderRobot Manipulation	—Unverified
3D-MVP: 3D Multiview Pretraining for Manipulation	Jan 1, 2025	DecoderRobot Manipulation	—Unverified
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction	Sep 27, 2023	Graph LearningPrediction	—Unverified
AVD2: Accident Video Diffusion for Accident Video Description	Feb 20, 2025	Autonomous DrivingScene Understanding	—Unverified
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding	Feb 22, 2020	3D geometryglobal-optimization	—Unverified
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding	Jun 28, 2025	3DGSInstance Segmentation	—Unverified
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding	Mar 3, 2025	Scene UnderstandingSimultaneous Localization and Mapping	—Unverified
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery	Mar 16, 2023	Scene Understanding	—Unverified
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM	Sep 14, 2018	Bayesian InferenceObject	—Unverified
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents	Jan 23, 2024	Instruction FollowingScene Understanding	—Unverified
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation	Nov 29, 2024	Motion PlanningRAG	—Unverified
Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision	Sep 12, 2017	object-detectionObject Detection	—Unverified
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation	Mar 2, 2021	Domain AdaptationScene Understanding	—Unverified
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition	Sep 18, 2019	Activity RecognitionCaption Generation	—Unverified
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer	Jan 2, 2025	Scene Understanding	—Unverified

Show:10 25 50

← PrevPage 28 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified