Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1126–1150 of 1723 papers

Title	Date	Tasks	Status	Hype
Indoor Semantic Scene Understanding using Multi-modality Fusion	Aug 17, 2021	Scene Understanding	—Unverified	0
UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks	Aug 10, 2021	Depth EstimationDepth Prediction	CodeCode Available	0
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition	Aug 10, 2021	Action ClassificationAction Recognition	CodeCode Available	1
Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos	Aug 9, 2021	3D geometry3D Geometry Perception	—Unverified	0
One-Shot Object Affordance Detection in the Wild	Aug 8, 2021	Action RecognitionAffordance Detection	CodeCode Available	1
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images	Aug 6, 2021	Depth EstimationPanoptic Segmentation	CodeCode Available	1
Interpretable Visual Understanding with Cognitive Attention Network	Aug 6, 2021	Scene UnderstandingVisual Commonsense Reasoning	CodeCode Available	0
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection	Jul 30, 2021	3D Object Detectionobject-detection	CodeCode Available	1
CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation	Jul 29, 2021	Depth EstimationMonocular Depth Estimation	—Unverified	0
Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset	Jul 27, 2021	Scene Text RecognitionScene Understanding	CodeCode Available	1
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation	Jul 25, 2021	Active LearningDeep Learning	CodeCode Available	1
Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation	Jul 23, 2021	Domain AdaptationFew-Shot Learning	CodeCode Available	1
Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds	Jul 23, 2021	Point Cloud SegmentationScene Understanding	—Unverified	0
Photon-Starved Scene Inference using Single Photon Cameras	Jul 23, 2021	Depth Estimationimage-classification	CodeCode Available	1
Weighted Intersection over Union (wIoU) for Evaluating Image Segmentation	Jul 21, 2021	Image Segmentationobject-detection	CodeCode Available	0
Generative Video Transformer: Can Objects be the Words?	Jul 20, 2021	GPUScene Understanding	—Unverified	0
Accelerating deep neural networks for efficient scene understanding in automotive cyber-physical systems	Jul 19, 2021	Model Compressionobject-detection	—Unverified	0
CodeMapping: Real-Time Dense Mapping for Sparse SLAM using Compact Scene Representations	Jul 19, 2021	3D ReconstructionDepth Estimation	—Unverified	0
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference	Jul 16, 2021	Scene UnderstandingSegmentation	—Unverified	0
SynPick: A Dataset for Dynamic Bin Picking Scene Understanding	Jul 10, 2021	ARCDataset Generation	CodeCode Available	1
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism	Jul 10, 2021	Depth EstimationMonocular Depth Estimation	—Unverified	0
Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting	Jul 6, 2021	3D Object DetectionAutonomous Driving	CodeCode Available	0
Empowering cyberphysical systems of systems with intelligence	Jul 5, 2021	Decision MakingManagement	—Unverified	0
Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface	Jul 4, 2021	Scene UnderstandingTime Series	—Unverified	0
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory	Jul 4, 2021	Question AnsweringScene Understanding	CodeCode Available	0

Show:10 25 50

← PrevPage 46 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified