Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–975 of 1723 papers

Title	Date	Tasks	Status	Hype
Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance Segmentation	Aug 9, 2022	3D Instance Segmentation3D Part Segmentation	CodeCode Available	1
TAG: Boosting Text-VQA via Text-aware Visual Question-answer Generation	Aug 3, 2022	Answer GenerationQuestion-Answer-Generation	CodeCode Available	1
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy	Aug 3, 2022	Anatomymotion prediction	—Unverified	0
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer	Jul 28, 2022	Autonomous DrivingAutonomous Vehicles	CodeCode Available	2
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point Cloud	Jul 28, 2022	Scene Understanding	CodeCode Available	1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous Driving	Jul 26, 2022	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
CompNVS: Novel View Synthesis with Scene Completion	Jul 23, 2022	Novel View SynthesisScene Understanding	—Unverified	0
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language Models	Jul 23, 2022	Scene Understanding	CodeCode Available	1
Panoptic Scene Graph Generation	Jul 22, 2022	BenchmarkingPanoptic Scene Graph Generation	CodeCode Available	2
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise Binarization	Jul 22, 2022	3D Instance Segmentation3D Object Detection	CodeCode Available	1
Neural Groundplans: Persistent Neural Scene Representations from a Single Image	Jul 22, 2022	DisentanglementInstance Segmentation	—Unverified	0
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany	Jul 19, 2022	Image RetrievalRetrieval	—Unverified	0
Egocentric Scene Understanding via Multimodal Spatial Rectifier	Jul 14, 2022	Scene UnderstandingSurface Normal Estimation	CodeCode Available	1
Adversarial Attacks on Monocular Pose Estimation	Jul 14, 2022	Depth EstimationMonocular Depth Estimation	CodeCode Available	0
Efficient Multi-Task RGB-D Scene Analysis for Indoor Environments	Jul 10, 2022	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
BlindSpotNet: Seeing Where We Cannot See	Jul 8, 2022	Depth EstimationMonocular Depth Estimation	—Unverified	0
MCTS with Refinement for Proposals Selection Games in Scene Understanding	Jul 7, 2022	Scene Understanding	CodeCode Available	1
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases	Jul 5, 2022	ObjectRepresentation Learning	—Unverified	0
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available	0
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation	Jul 5, 2022	Dialogue GenerationDialogue Understanding	—Unverified	0
Uncertainty-aware Panoptic Segmentation	Jun 29, 2022	Panoptic SegmentationScene Understanding	CodeCode Available	1
MGNet: Monocular Geometric Scene Understanding for Autonomous Driving	Jun 27, 2022	Autonomous DrivingDepth Estimation	CodeCode Available	1
IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic Environments	Jun 27, 2022	Autonomous VehiclesScene Segmentation	CodeCode Available	1
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings	Jun 24, 2022	Scene UnderstandingSemantic Segmentation	CodeCode Available	0
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive Learning	Jun 21, 2022	Contrastive LearningDomain Generalization	CodeCode Available	1

Show:10 25 50

← PrevPage 39 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified