Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 426–450 of 1723 papers

Title	Date	Tasks	Status	Hype
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation	Jul 9, 2020	Scene Understanding	CodeCode Available	1
Learning and Reasoning with the Graph Structure Representation in Robotic Surgery	Jul 7, 2020	Edge ClassificationGraph Generation	CodeCode Available	1
A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence	Jun 22, 2020	Deep LearningScene Understanding	CodeCode Available	1
Learning Visual Commonsense for Robust Scene Graph Generation	Jun 17, 2020	Graph GenerationScene Graph Generation	CodeCode Available	1
Benchmarking Unsupervised Object Representations for Video Sequences	Jun 12, 2020	BenchmarkingClustering	CodeCode Available	1
0-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event Camera	Jun 11, 2020	Motion CompensationMotion Segmentation	CodeCode Available	1
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving	Jun 1, 2020	3D Object DetectionAutonomous Driving	CodeCode Available	1
VTGNet: A Vision-based Trajectory Generation Network for Autonomous Vehicles in Urban Environments	Apr 27, 2020	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding	Apr 16, 2020	Human Part SegmentationPanoptic Segmentation	CodeCode Available	1
Self-Supervised Scene De-occlusion	Apr 6, 2020	Image ManipulationScene Understanding	CodeCode Available	1
Context Prior for Scene Segmentation	Apr 3, 2020	Scene SegmentationScene Understanding	CodeCode Available	1
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation	Apr 3, 2020	3D Instance SegmentationClustering	CodeCode Available	1
Semantic Segmentation of Underwater Imagery: Dataset and Benchmark	Apr 2, 2020	Computational EfficiencyDecoder	CodeCode Available	1
Occlusion-Aware Depth Estimation with Adaptive Normal Constraints	Apr 2, 2020	3D ReconstructionDepth Estimation	CodeCode Available	1
Distilled Semantics for Comprehensive Scene Understanding from Videos	Mar 31, 2020	Depth EstimationKnowledge Distillation	CodeCode Available	1
Learning Human-Object Interaction Detection using Interaction Points	Mar 31, 2020	Human-Object Interaction DetectionKeypoint Detection	CodeCode Available	1
LayoutMP3D: Layout Annotation of Matterport3D	Mar 30, 2020	Scene Understanding	CodeCode Available	1
Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds	Mar 29, 2020	3D Semantic SegmentationPoint Cloud Segmentation	CodeCode Available	1
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks	Mar 28, 2020	3D Medical Imaging SegmentationAction Recognition	CodeCode Available	1
SaccadeNet: A Fast and Accurate Object Detector	Mar 26, 2020	Objectobject-detection	CodeCode Available	1
Who2com: Collaborative Perception via Learnable Handshake Communication	Mar 21, 2020	Multi-agent Reinforcement LearningReinforcement Learning	CodeCode Available	1
Explainable Object-induced Action Decision for Autonomous Vehicles	Mar 20, 2020	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways	Mar 18, 2020	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
Scene Completeness-Aware Lidar Depth Completion for Driving Scenario	Mar 15, 2020	Depth CompletionRGBD Semantic Segmentation	CodeCode Available	1
Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image	Feb 27, 2020	3D Object Detection3D Shape Reconstruction	CodeCode Available	1

Show:10 25 50

← PrevPage 18 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified