Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 221–230 of 1723 papers

Title	Date	Tasks	Status	Hype
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views	Feb 6, 2025	3D Reconstruction3D Scene Reconstruction	—Unverified	0
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation	Feb 4, 2025	Contrastive LearningDecoder	—Unverified	0
Event-aided Semantic Scene Completion	Feb 4, 2025	Autonomous DrivingScene Understanding	CodeCode Available	1
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis	Feb 3, 2025	Object CountingScene Understanding	—Unverified	0
Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation	Jan 30, 2025	MemorizationScene Understanding	—Unverified	0
Efficient Interactive 3D Multi-Object Removal	Jan 29, 2025	ObjectScene Understanding	—Unverified	0
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Jan 28, 2025	object-detectionObject Detection	—Unverified	0
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding	Jan 27, 2025	BenchmarkingCommon Sense Reasoning	—Unverified	0
Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics	Jan 26, 2025	Object RecognitionScene Understanding	—Unverified	0
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and Generation	Jan 24, 2025	Autonomous DrivingLanguage Modeling	CodeCode Available	3

Show:10 25 50

← PrevPage 23 of 173Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified