Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–375 of 1723 papers

Title	Date	Tasks	Status	Hype
ODAM: Object Detection, Association, and Mapping using Posed RGB Video	Aug 23, 2021	3D Object DetectionGraph Neural Network	CodeCode Available	1
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks	Aug 17, 2021	3D Instance SegmentationInstance Segmentation	CodeCode Available	1
A Hybrid Sparse-Dense Monocular SLAM System for Autonomous Driving	Aug 17, 2021	Autonomous DrivingDepth Estimation	CodeCode Available	1
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition	Aug 10, 2021	Action ClassificationAction Recognition	CodeCode Available	1
One-Shot Object Affordance Detection in the Wild	Aug 8, 2021	Action RecognitionAffordance Detection	CodeCode Available	1
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images	Aug 6, 2021	Depth EstimationPanoptic Segmentation	CodeCode Available	1
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection	Jul 30, 2021	3D Object Detectionobject-detection	CodeCode Available	1
Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset	Jul 27, 2021	Scene Text RecognitionScene Understanding	CodeCode Available	1
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation	Jul 25, 2021	Active LearningDeep Learning	CodeCode Available	1
Photon-Starved Scene Inference using Single Photon Cameras	Jul 23, 2021	Depth Estimationimage-classification	CodeCode Available	1
Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation	Jul 23, 2021	Domain AdaptationFew-Shot Learning	CodeCode Available	1
SynPick: A Dataset for Dynamic Bin Picking Scene Understanding	Jul 10, 2021	ARCDataset Generation	CodeCode Available	1
A Survey on Deep Learning Technique for Video Segmentation	Jul 2, 2021	Autonomous DrivingDeep Learning	CodeCode Available	1
P2T: Pyramid Pooling Transformer for Scene Understanding	Jun 22, 2021	image-classificationImage Classification	CodeCode Available	1
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation	Jun 21, 2021	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation	Jun 19, 2021	Autonomous DrivingGPU	CodeCode Available	1
Part-aware Panoptic Segmentation	Jun 11, 2021	Image SegmentationPanoptic Segmentation	CodeCode Available	1
Vision Transformers with Hierarchical Attention	Jun 6, 2021	image-classificationImage Classification	CodeCode Available	1
Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering	Jun 4, 2021	Meta-LearningScene Understanding	CodeCode Available	1
Lane Graph Estimation for Scene Understanding in Urban Driving	May 1, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition	Apr 24, 2021	Image CaptioningObject Recognition	CodeCode Available	1
SSPC-Net: Semi-supervised Semantic 3D Point Cloud Segmentation Network	Apr 16, 2021	Point Cloud SegmentationScene Understanding	CodeCode Available	1
Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition	Apr 12, 2021	Instance SegmentationScene Understanding	CodeCode Available	1
Semantic Scene Completion via Integrating Instances and Scene in-the-Loop	Apr 8, 2021	3D Semantic Scene CompletionScene Understanding	CodeCode Available	1
Learning Triadic Belief Dynamics in Nonverbal Communication from Videos	Apr 7, 2021	Scene Understanding	CodeCode Available	1

Show:10 25 50

← PrevPage 15 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified