Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1723 papers

Title	Date	Tasks	Status	Hype
Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation	Aug 9, 2022	3D Instance Segmentation3D Part Segmentation	CodeCode Available	1
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation	Aug 3, 2022	Answer GenerationQuestion-Answer-Generation	CodeCode Available	1
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy	Aug 3, 2022	Anatomymotion prediction	—Unverified	0
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer	Jul 28, 2022	Autonomous DrivingAutonomous Vehicles	CodeCode Available	2
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud	Jul 28, 2022	Scene Understanding	CodeCode Available	1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving	Jul 26, 2022	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
CompNVS: Novel View Synthesis with Scene Completion	Jul 23, 2022	Novel View SynthesisScene Understanding	—Unverified	0
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models	Jul 23, 2022	Scene Understanding	CodeCode Available	1
Panoptic Scene Graph Generation	Jul 22, 2022	BenchmarkingPanoptic Scene Graph Generation	CodeCode Available	2
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise Binarization	Jul 22, 2022	3D Instance Segmentation3D Object Detection	CodeCode Available	1
Neural Groundplans: Persistent Neural Scene Representations from a Single Image	Jul 22, 2022	DisentanglementInstance Segmentation	—Unverified	0
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Jul 19, 2022	Image RetrievalRetrieval	—Unverified	0
Egocentric Scene Understanding via Multimodal Spatial Rectifier	Jul 14, 2022	Scene UnderstandingSurface Normal Estimation	CodeCode Available	1
Adversarial Attacks on Monocular Pose Estimation	Jul 14, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available	0
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments	Jul 10, 2022	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
BlindSpotNet: Seeing Where We Cannot See	Jul 8, 2022	Depth EstimationMonocular Depth Estimation	—Unverified	0
MCTS with Refinement for Proposals Selection Games in Scene Understanding	Jul 7, 2022	Scene Understanding	CodeCode Available	1
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases	Jul 5, 2022	ObjectRepresentation Learning	—Unverified	0
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation	Jul 5, 2022	Dialogue GenerationDialogue Understanding	—Unverified	0
Uncertainty-aware Panoptic Segmentation	Jun 29, 2022	Panoptic SegmentationScene Understanding	CodeCode Available	1
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving	Jun 27, 2022	Autonomous DrivingDepth Estimation	CodeCode Available	1
IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Jun 27, 2022	Autonomous VehiclesScene Segmentation	CodeCode Available	1
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings	Jun 24, 2022	Scene UnderstandingSemantic Segmentation	CodeCode Available	0
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning	Jun 21, 2022	Contrastive LearningDomain Generalization	CodeCode Available	1
SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding	Jun 21, 2022	ClusteringObject Discovery	CodeCode Available	0
A Dynamic Data Driven Approach for Explainable Scene Understanding	Jun 18, 2022	Autonomous DrivingScene Understanding	—Unverified	0
On Efficient Real-Time Semantic Segmentation: A Survey	Jun 17, 2022	GPUobject-detection	—Unverified	0
Waymo Open Dataset: Panoramic Video Panoptic Segmentation	Jun 15, 2022	3D Multi-Object TrackingAutonomous Driving	—Unverified	0
A Multi-purpose Realistic Haze Benchmark with Quantifiable Haze Levels and Ground Truth	Jun 13, 2022	Objectobject-detection	—Unverified	0
Unsupervised Foggy Scene Understanding via Self Spatial-Temporal Label Diffusion	Jun 10, 2022	Autonomous DrivingDomain Adaptation	CodeCode Available	0
Beyond RGB: Scene-Property Synthesis with Neural Radiance Fields	Jun 9, 2022	Data AugmentationEdge Detection	—Unverified	0
Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding	Jun 9, 2022	Common Sense ReasoningScene Understanding	—Unverified	0
Scan2Part: Fine-grained and Hierarchical Part-level Understanding of Real-World 3D Scans	Jun 6, 2022	Scene Understanding	—Unverified	0
A Memory System of a Robot Cognitive Architecture and its Implementation in ArmarX	Jun 5, 2022	Scene Understanding	—Unverified	0
Towards Improving the Generation Quality of Autoregressive Slot VAEs	Jun 3, 2022	Image GenerationObject	CodeCode Available	0
SAMPLE-HD: Simultaneous Action and Motion Planning Learning Environment	Jun 1, 2022	Motion PlanningQuestion Answering	—Unverified	0
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and Reasoning	May 31, 2022	Common Sense ReasoningGraph Generation	CodeCode Available	1
Facing the Void: Overcoming Missing Data in Multi-View Imagery	May 21, 2022	Classificationimage-classification	CodeCode Available	0
Review on Panoramic Imaging and Its Applications in Scene Understanding	May 11, 2022	Autonomous DrivingDepth Estimation	—Unverified	0
Unsupervised Discovery and Composition of Object Light Fields	May 8, 2022	Novel View SynthesisObject	—Unverified	0
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects	May 5, 2022	Data AugmentationNeural Rendering	—Unverified	0
RangeSeg: Range-Aware Real Time Segmentation of 3D LiDAR Point Clouds	May 2, 2022	Autonomous DrivingDecoder	—Unverified	0
BBBD: Bounding Box Based Detector for Occlusion Detection and Order Recovery	Apr 27, 2022	object-detectionObject Detection	—Unverified	0
SceneTrilogy: On Human Scene-Sketch and its Complementarity with Photo and Text	Apr 25, 2022	Image RetrievalRetrieval	—Unverified	0
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified	0
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?	Apr 23, 2022	Robot ManipulationScene Understanding	—Unverified	0
Spatiality-guided Transformer for 3D Dense Captioning on Point Clouds	Apr 22, 2022	3D dense captioning3D Object Detection	CodeCode Available	1
SELMA: SEmantic Large-scale Multimodal Acquisitions in Variable Weather, Daytime and Viewpoints	Apr 20, 2022	Autonomous DrivingScene Understanding	—Unverified	0
Attention Mechanism based Cognition-level Scene Understanding	Apr 17, 2022	Question AnsweringScene Understanding	—Unverified	0

Show:10 25 50

← PrevPage 20 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified