Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1723 papers

Title	Date	Tasks	Status	Hype
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor Environments	Dec 8, 2020	Camera RelocalizationRobot Navigation	CodeCode Available	1
Understanding Bird's-Eye View of Road Semantics using an Onboard Camera	Dec 5, 2020	Autonomous NavigationAutonomous Vehicles	CodeCode Available	1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding	Dec 5, 2020	image-classificationImage Classification	CodeCode Available	1
Towards Part-Based Understanding of RGB-D Scans	Dec 3, 2020	3D Instance SegmentationInstance Segmentation	CodeCode Available	1
Group Contextual Encoding for 3D Point Clouds	Dec 1, 2020	Scene Understanding	CodeCode Available	1
RfD-Net: Point Scene Understanding by Semantic Instance Reconstruction	Nov 30, 2020	3D geometryObject	CodeCode Available	1
Visual place recognition: A survey from deep learning perspective	Nov 28, 2020	Deep LearningLoop Closure Detection	CodeCode Available	1
RELLIS-3D Dataset: Data, Benchmarks and Analysis	Nov 17, 2020	3D Semantic SegmentationAutonomous Navigation	CodeCode Available	1
SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple Environments	Nov 9, 2020	Autonomous DrivingDepth Estimation	CodeCode Available	1
Towards Efficient Scene Understanding via Squeeze Reasoning	Nov 6, 2020	Instance Segmentationobject-detection	CodeCode Available	1
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic Segmentation	Oct 30, 2020	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce Model	Oct 25, 2020	Depth EstimationDepth Prediction	CodeCode Available	1
RADIATE: A Radar Dataset for Automotive Perception in Bad Weather	Oct 18, 2020	Autonomous DrivingBenchmarking	CodeCode Available	1
ALFWorld: Aligning Text and Embodied Environments for Interactive Learning	Oct 8, 2020	Natural Language Visual GroundingScene Understanding	CodeCode Available	1
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene Understanding	Oct 1, 2020	Deep Learningimage-classification	CodeCode Available	1
BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving Environments	Sep 22, 2020	Domain AdaptationScene Understanding	CodeCode Available	1
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and Challenges	Sep 7, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available	1
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor Scene	Aug 11, 2020	Instance SegmentationPoint Cloud Segmentation	CodeCode Available	1
Polysemy Deciphering Network for Robust Human-Object Interaction Detection	Aug 7, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Pose-based Modular Network for Human-Object Interaction Detection	Aug 5, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Polysemy Deciphering Network for Human-Object Interaction Detection	Aug 1, 2020	Human-Object Interaction DetectionObject	CodeCode Available	1
Weakly Supervised 3D Object Detection from Point Clouds	Jul 28, 2020	3D Object DetectionKnowledge Distillation	CodeCode Available	1
Virtual Multi-view Fusion for 3D Semantic Segmentation	Jul 26, 2020	2D Semantic Segmentation3D Semantic Segmentation	CodeCode Available	1
Few-Shot Object Detection and Viewpoint Estimation for Objects in the Wild	Jul 23, 2020	Few-Shot Object DetectionMeta-Learning	CodeCode Available	1
PointContrast: Unsupervised Pre-training for 3D Point Cloud Understanding	Jul 21, 2020	Point Cloud Pre-trainingRepresentation Learning	CodeCode Available	1
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation	Jul 9, 2020	Scene Understanding	CodeCode Available	1
Learning and Reasoning with the Graph Structure Representation in Robotic Surgery	Jul 7, 2020	Edge ClassificationGraph Generation	CodeCode Available	1
A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence	Jun 22, 2020	Deep LearningScene Understanding	CodeCode Available	1
Learning Visual Commonsense for Robust Scene Graph Generation	Jun 17, 2020	Graph GenerationScene Graph Generation	CodeCode Available	1
Benchmarking Unsupervised Object Representations for Video Sequences	Jun 12, 2020	BenchmarkingClustering	CodeCode Available	1
0-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event Camera	Jun 11, 2020	Motion CompensationMotion Segmentation	CodeCode Available	1
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous Driving	Jun 1, 2020	3D Object DetectionAutonomous Driving	CodeCode Available	1
VTGNet: A Vision-based Trajectory Generation Network for Autonomous Vehicles in Urban Environments	Apr 27, 2020	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene Understanding	Apr 16, 2020	Human Part SegmentationPanoptic Segmentation	CodeCode Available	1
Self-Supervised Scene De-occlusion	Apr 6, 2020	Image ManipulationScene Understanding	CodeCode Available	1
Context Prior for Scene Segmentation	Apr 3, 2020	Scene SegmentationScene Understanding	CodeCode Available	1
PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation	Apr 3, 2020	3D Instance SegmentationClustering	CodeCode Available	1
Semantic Segmentation of Underwater Imagery: Dataset and Benchmark	Apr 2, 2020	Computational EfficiencyDecoder	CodeCode Available	1
Occlusion-Aware Depth Estimation with Adaptive Normal Constraints	Apr 2, 2020	3D ReconstructionDepth Estimation	CodeCode Available	1
Distilled Semantics for Comprehensive Scene Understanding from Videos	Mar 31, 2020	Depth EstimationKnowledge Distillation	CodeCode Available	1
Learning Human-Object Interaction Detection using Interaction Points	Mar 31, 2020	Human-Object Interaction DetectionKeypoint Detection	CodeCode Available	1
LayoutMP3D: Layout Annotation of Matterport3D	Mar 30, 2020	Scene Understanding	CodeCode Available	1
Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point Clouds	Mar 29, 2020	3D Semantic SegmentationPoint Cloud Segmentation	CodeCode Available	1
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D Networks	Mar 28, 2020	3D Medical Imaging SegmentationAction Recognition	CodeCode Available	1
SaccadeNet: A Fast and Accurate Object Detector	Mar 26, 2020	Objectobject-detection	CodeCode Available	1
Who2com: Collaborative Perception via Learnable Handshake Communication	Mar 21, 2020	Multi-agent Reinforcement LearningReinforcement Learning	CodeCode Available	1
Explainable Object-induced Action Decision for Autonomous Vehicles	Mar 20, 2020	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban Roadways	Mar 18, 2020	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
Scene Completeness-Aware Lidar Depth Completion for Driving Scenario	Mar 15, 2020	Depth CompletionRGBD Semantic Segmentation	CodeCode Available	1
Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single Image	Feb 27, 2020	3D Object Detection3D Shape Reconstruction	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified