Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1401–1425 of 1723 papers

Title	Date	Tasks	Status
Single Image 3D Without a Single 3D Image	Dec 1, 2015	Scene Understanding	—Unverified
Single Image Depth Estimation: An Overview	Apr 13, 2021	Deep LearningDepth Estimation	—Unverified
Single-Input Multi-Output Model Merging: Leveraging Foundation Models for Dense Multi-Task Learning	Apr 15, 2025	Multi-Task LearningScene Understanding	—Unverified
3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning	Feb 13, 2025	Code GenerationScene Understanding	—Unverified
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy	Aug 3, 2022	Anatomymotion prediction	—Unverified
You Only Scan Once: A Dynamic Scene Reconstruction Pipeline for 6-DoF Robotic Grasping of Novel Objects	Apr 4, 2024	ObjectPose Tracking	—Unverified
Single-view 3D Scene Reconstruction with High-fidelity Shape and Texture	Nov 1, 2023	3D Object Reconstruction3D Reconstruction	—Unverified
Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation through Hybrid Vision	Jul 23, 2024	2D Semantic Segmentation3D Semantic Segmentation	—Unverified
Waymo Open Dataset: Panoramic Video Panoptic Segmentation	Jun 15, 2022	3D Multi-Object TrackingAutonomous Driving	—Unverified
SkyScenes: A Synthetic Dataset for Aerial Scene Understanding	Dec 11, 2023	DiversityScene Understanding	—Unverified
SLGaussian: Fast Language Gaussian Splatting in Sparse Views	Dec 11, 2024	3DGSAutonomous Navigation	—Unverified
Weakly Supervised 3D Instance Segmentation without Instance-level Annotations	Aug 3, 2023	3D Instance SegmentationInstance Segmentation	—Unverified
Small Drone Field Experiment: Data Collection & Processing	Nov 29, 2017	3D ReconstructionScene Understanding	—Unverified
Small-Variance Nonparametric Clustering on the Hypersphere	Jul 21, 2016	ClusteringNonparametric Clustering	—Unverified
Smart Infrastructure: A Research Junction	Jul 12, 2023	Scene UnderstandingSynthetic Data Generation	—Unverified
Audiovisual Highlight Detection in Videos	Feb 11, 2021	Highlight DetectionObject Recognition	—Unverified
SNeL: A Structured Neuro-Symbolic Language for Entity-Based Multimodal Scene Understanding	Jun 9, 2023	Scene Understanding	—Unverified
Audio-visual Event Localization on Portrait Mode Short Videos	Apr 9, 2025	audio-visual event localizationScene Understanding	—Unverified
Social Scene Understanding: End-to-End Multi-Person Action Localization and Collective Activity Recognition	Nov 28, 2016	Action LocalizationAction Recognition	—Unverified
3D Gated Recurrent Fusion for Semantic Scene Completion	Feb 17, 2020	3D Semantic Scene CompletionScene Understanding	—Unverified
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications	Feb 8, 2019	Deep LearningHigh-Level Synthesis	—Unverified
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images	Jan 19, 2021	Depth EstimationMonocular Depth Estimation	—Unverified
So you think you can track?	Sep 13, 2023	BenchmarkingObject	—Unverified
SparseLGS: Sparse View Language Embedded Gaussian Splatting	Dec 3, 2024	Scene Understanding	—Unverified
SPARTUN3D: Situated Spatial Understanding of 3D World in Large Language Models	Oct 4, 2024	Scene UnderstandingSpatial Reasoning	—Unverified

Show:10 25 50

← PrevPage 57 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified