Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 976–1000 of 1723 papers

Title	Date	Tasks	Status	Hype
SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding	Jun 21, 2022	ClusteringObject Discovery	CodeCode Available	0
A Dynamic Data Driven Approach for Explainable Scene Understanding	Jun 18, 2022	Autonomous DrivingScene Understanding	—Unverified	0
On Efficient Real-Time Semantic Segmentation: A Survey	Jun 17, 2022	GPUobject-detection	—Unverified	0
Waymo Open Dataset: Panoramic Video Panoptic Segmentation	Jun 15, 2022	3D Multi-Object TrackingAutonomous Driving	—Unverified	0
A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth	Jun 13, 2022	Objectobject-detection	—Unverified	0
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	Jun 10, 2022	Autonomous DrivingDomain Adaptation	CodeCode Available	0
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Jun 9, 2022	Data AugmentationEdge Detection	—Unverified	0
Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding	Jun 9, 2022	Common Sense ReasoningScene Understanding	—Unverified	0
Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans	Jun 6, 2022	Scene Understanding	—Unverified	0
A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX	Jun 5, 2022	Scene Understanding	—Unverified	0
Towards Improving the Generation Quality of Autoregressive Slot VAEs	Jun 3, 2022	Image GenerationObject	CodeCode Available	0
SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment	Jun 1, 2022	Motion PlanningQuestion Answering	—Unverified	0
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning	May 31, 2022	Common Sense ReasoningGraph Generation	CodeCode Available	1
Facing the Void: Overcoming Missing Data in Multi-View Imagery	May 21, 2022	Classificationimage-classification	CodeCode Available	0
Review on Panoramic Imaging and Its Applications in Scene Understanding	May 11, 2022	Autonomous DrivingDepth Estimation	—Unverified	0
Unsupervised Discovery and Composition of Object Light Fields	May 8, 2022	Novel View SynthesisObject	—Unverified	0
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects	May 5, 2022	Data AugmentationNeural Rendering	—Unverified	0
RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds	May 2, 2022	Autonomous DrivingDecoder	—Unverified	0
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery	Apr 27, 2022	object-detectionObject Detection	—Unverified	0
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified	0
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified	0
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?	Apr 23, 2022	Robot ManipulationScene Understanding	—Unverified	0
Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds	Apr 22, 2022	3D dense captioning3D Object Detection	CodeCode Available	1
SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints	Apr 20, 2022	Autonomous DrivingScene Understanding	—Unverified	0
Attention Mechanism based Cognition-level Scene Understanding	Apr 17, 2022	Question AnsweringScene Understanding	—Unverified	0

Show:10 25 50

← PrevPage 40 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified