Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 1723 papers

Title	Date	Tasks	Status	Hype
ODAM: Object Detection, Association, and Mapping using Posed RGB Video	Aug 23, 2021	3D Object DetectionGraph Neural Network	CodeCode Available	1
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks	Aug 17, 2021	3D Instance SegmentationInstance Segmentation	CodeCode Available	1
A Hybrid Sparse-Dense Monocular SLAM System for Autonomous Driving	Aug 17, 2021	Autonomous DrivingDepth Estimation	CodeCode Available	1
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition	Aug 10, 2021	Action ClassificationAction Recognition	CodeCode Available	1
One-Shot Object Affordance Detection in the Wild	Aug 8, 2021	Action RecognitionAffordance Detection	CodeCode Available	1
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images	Aug 6, 2021	Depth EstimationPanoptic Segmentation	CodeCode Available	1
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection	Jul 30, 2021	3D Object Detectionobject-detection	CodeCode Available	1
Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset	Jul 27, 2021	Scene Text RecognitionScene Understanding	CodeCode Available	1
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation	Jul 25, 2021	Active LearningDeep Learning	CodeCode Available	1
Photon-Starved Scene Inference using Single Photon Cameras	Jul 23, 2021	Depth Estimationimage-classification	CodeCode Available	1
Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation	Jul 23, 2021	Domain AdaptationFew-Shot Learning	CodeCode Available	1
SynPick: A Dataset for Dynamic Bin Picking Scene Understanding	Jul 10, 2021	ARCDataset Generation	CodeCode Available	1
A Survey on Deep Learning Technique for Video Segmentation	Jul 2, 2021	Autonomous DrivingDeep Learning	CodeCode Available	1
P2T: Pyramid Pooling Transformer for Scene Understanding	Jun 22, 2021	image-classificationImage Classification	CodeCode Available	1
EPMF: Efficient Perception-aware Multi-sensor Fusion for 3D Semantic Segmentation	Jun 21, 2021	3D Semantic SegmentationAutonomous Driving	CodeCode Available	1
Projecting Your View Attentively: Monocular Road Scene Layout Estimation via Cross-View Transformation	Jun 19, 2021	Autonomous DrivingGPU	CodeCode Available	1
Part-aware Panoptic Segmentation	Jun 11, 2021	Image SegmentationPanoptic Segmentation	CodeCode Available	1
Vision Transformers with Hierarchical Attention	Jun 6, 2021	image-classificationImage Classification	CodeCode Available	1
Light Field Networks: Neural Scene Representations with Single-Evaluation Rendering	Jun 4, 2021	Meta-LearningScene Understanding	CodeCode Available	1
Lane Graph Estimation for Scene Understanding in Urban Driving	May 1, 2021	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition	Apr 24, 2021	Image CaptioningObject Recognition	CodeCode Available	1
SSPC-Net: Semi-supervised Semantic 3D Point Cloud Segmentation Network	Apr 16, 2021	Point Cloud SegmentationScene Understanding	CodeCode Available	1
Visiting the Invisible: Layer-by-Layer Completed Scene Decomposition	Apr 12, 2021	Instance SegmentationScene Understanding	CodeCode Available	1
Semantic Scene Completion via Integrating Instances and Scene in-the-Loop	Apr 8, 2021	3D Semantic Scene CompletionScene Understanding	CodeCode Available	1
Affordance Transfer Learning for Human-Object Interaction Detection	Apr 7, 2021	Affordance DetectionAffordance Recognition	CodeCode Available	1
Learning Triadic Belief Dynamics in Nonverbal Communication from Videos	Apr 7, 2021	Scene Understanding	CodeCode Available	1
Multi-View Radar Semantic Segmentation	Mar 30, 2021	Autonomous Drivingobject-detection	CodeCode Available	1
SceneGraphFusion: Incremental 3D Scene Graph Prediction from RGB-D Sequences	Mar 27, 2021	3D Object Classification3d scene graph generation	CodeCode Available	1
Bidirectional Projection Network for Cross Dimension Scene Understanding	Mar 26, 2021	2D Semantic Segmentation3D Semantic Segmentation	CodeCode Available	1
Tracking Pedestrian Heads in Dense Crowd	Mar 24, 2021	Head DetectionMulti-Object Tracking	CodeCode Available	1
Relation-aware Instance Refinement for Weakly Supervised Visual Grounding	Mar 24, 2021	ObjectRelation	CodeCode Available	1
OFFSEG: A Semantic Segmentation Framework For Off-Road Driving	Mar 23, 2021	Scene UnderstandingSegmentation	CodeCode Available	1
Detecting Human-Object Interaction via Fabricated Compositional Learning	Mar 15, 2021	Affordance RecognitionHuman-Object Interaction Detection	CodeCode Available	1
Monte Carlo Scene Search for 3D Scene Understanding	Mar 14, 2021	Scene Understanding	CodeCode Available	1
Holistic 3D Scene Understanding from a Single Image with Implicit Representation	Mar 11, 2021	3D Object Detection3D Shape Reconstruction	CodeCode Available	1
Affect2MM: Affective Analysis of Multimedia Content Using Emotion Causality	Mar 11, 2021	Scene UnderstandingTime Series	CodeCode Available	1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph Analysis	Mar 9, 2021	3d scene graph generationgraph construction	CodeCode Available	1
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive Learning	Mar 1, 2021	Contrastive LearningPanoptic Segmentation	CodeCode Available	1
FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud Segmentation	Mar 1, 2021	3D Semantic SegmentationDecoder	CodeCode Available	1
Boundary-induced and scene-aggregated network for monocular depth prediction	Feb 26, 2021	Depth EstimationDepth Prediction	CodeCode Available	1
4D Panoptic LiDAR Segmentation	Feb 24, 2021	4D Panoptic SegmentationBenchmarking	CodeCode Available	1
RGB-D Railway Platform Monitoring and Scene Understanding for Enhanced Passenger Safety	Feb 23, 2021	Multi-Object TrackingMultiview Detection	CodeCode Available	1
Weakly Supervised Learning of Rigid 3D Scene Flow	Feb 17, 2021	Autonomous DrivingScene Flow Estimation	CodeCode Available	1
A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed Images	Feb 16, 2021	Decision MakingScene Understanding	CodeCode Available	1
Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout Estimation from Spherical Panoramas	Feb 7, 2021	Keypoint EstimationScene Understanding	CodeCode Available	1
OpenGF: An Ultra-Large-Scale Ground Filtering Dataset Built Upon Open ALS Point Clouds Around the World	Jan 24, 2021	3D Semantic SegmentationDeep Learning	CodeCode Available	1
Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor Setups	Jan 12, 2021	Scene Understanding	CodeCode Available	1
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship Detection	Jan 1, 2021	Common Sense ReasoningGraph Generation	CodeCode Available	1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene Contexts	Dec 16, 2020	3D Semantic SegmentationInstance Segmentation	CodeCode Available	1
Event-based Motion Segmentation with Spatio-Temporal Graph Cuts	Dec 16, 2020	Motion SegmentationScene Understanding	CodeCode Available	1

Show:10 25 50

← PrevPage 8 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified