Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1376–1400 of 1723 papers

Title	Date	Tasks	Status
Semi-Supervised Learning of Multi-Object 3D Scene Representations	Sep 28, 2020	Decision MakingObject	—Unverified
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors	Oct 8, 2020	Decision MakingScene Understanding	—Unverified
Semi-Supervised Semantic Depth Estimation using Symbiotic Transformer and NearFarMix Augmentation	Aug 28, 2023	Autonomous VehiclesDepth Estimation	—Unverified
Semi-Supervised Semantic Mapping through Label Propagation with Semantic Texture Meshes	Jun 17, 2019	Scene UnderstandingSemantic Segmentation	—Unverified
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024	Jun 2, 2024	Scene ParsingScene Understanding	—Unverified
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism	Jul 10, 2021	Depth EstimationMonocular Depth Estimation	—Unverified
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features	Jan 17, 2025	Language ModelingLanguage Modelling	—Unverified
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery	Nov 20, 2018	Scene UnderstandingSegmentation	—Unverified
Separated Inter/Intra-Modal Fusion Prompts for Compositional Zero-Shot Learning	Jan 22, 2025	AttributeCompositional Zero-Shot Learning	—Unverified
SEPT: Standard-Definition Map Enhanced Scene Perception and Topology Reasoning for Autonomous Driving	May 18, 2025	Autonomous DrivingAutonomous Vehicles	—Unverified
3D-MVP: 3D Multiview Pretraining for Robotic Manipulation	Jun 26, 2024	DecoderRobot Manipulation	—Unverified
3D-MVP: 3D Multiview Pretraining for Manipulation	Jan 1, 2025	DecoderRobot Manipulation	—Unverified
SGRec3D: Self-Supervised 3D Scene Graph Learning via Object-Level Scene Reconstruction	Sep 27, 2023	Graph LearningPrediction	—Unverified
AVD2: Accident Video Diffusion for Accident Video Description	Feb 20, 2025	Autonomous DrivingScene Understanding	—Unverified
Shallow2Deep: Indoor Scene Modeling by Single Image Understanding	Feb 22, 2020	3D geometryglobal-optimization	—Unverified
VoteSplat: Hough Voting Gaussian Splatting for 3D Scene Understanding	Jun 28, 2025	3DGSInstance Segmentation	—Unverified
vS-Graphs: Integrating Visual SLAM and Situational Graphs through Multi-level Scene Understanding	Mar 3, 2025	Scene UnderstandingSimultaneous Localization and Mapping	—Unverified
Shifted-Windows Transformers for the Detection of Cerebral Aneurysms in Microsurgery	Mar 16, 2023	Scene Understanding	—Unverified
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM	Sep 14, 2018	Bayesian InferenceObject	—Unverified
AutoRT: Embodied Foundation Models for Large Scale Orchestration of Robotic Agents	Jan 23, 2024	Instruction FollowingScene Understanding	—Unverified
SIMS: Simulating Stylized Human-Scene Interactions with Retrieval-Augmented Script Generation	Nov 29, 2024	Motion PlanningRAG	—Unverified
Automatic Ground Truths: Projected Image Annotations for Omnidirectional Vision	Sep 12, 2017	object-detectionObject Detection	—Unverified
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation	Mar 2, 2021	Domain AdaptationScene Understanding	—Unverified
Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition	Sep 18, 2019	Activity RecognitionCaption Generation	—Unverified
3D-LLaVA: Towards Generalist 3D LMMs with Omni Superpoint Transformer	Jan 2, 2025	Scene Understanding	—Unverified

Show:10 25 50

← PrevPage 56 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified