Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1723 papers

Title	Date	Tasks	Status	Hype
Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds	Sep 23, 2021	3D Semantic Scene Completion3D Semantic Segmentation	CodeCode Available	1
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation	Sep 20, 2021	DecoderPrediction	CodeCode Available	1
Audio-Visual Collaborative Representation Learning for Dynamic Saliency Prediction	Sep 17, 2021	Representation LearningSaliency Prediction	—Unverified	0
Label-Attention Transformer with Geometrically Coherent Objects for Image Captioning	Sep 16, 2021	DecoderImage Captioning	CodeCode Available	0
Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images	Sep 15, 2021	Autonomous NavigationDecision Making	—Unverified	0
On the Sins of Image Synthesis Loss for Self-supervised Depth Estimation	Sep 13, 2021	AttributeDepth Estimation	—Unverified	0
PQ-Transformer: Jointly Parsing 3D Objects and Layouts from Point Clouds	Sep 12, 2021	object-detectionObject Detection	CodeCode Available	1
Residual 3D Scene Flow Learning with Context-Aware Feature Extraction	Sep 10, 2021	Autonomous DrivingScene Flow Estimation	—Unverified	0
Single Image 3D Object Estimation with Primitive Graph Networks	Sep 9, 2021	Graph Neural NetworkObject	CodeCode Available	0
RefineCap: Concept-Aware Refinement for Image Captioning	Sep 8, 2021	DecoderDescriptive	—Unverified	0
Panoptic nuScenes: A Large-Scale Benchmark for LiDAR Panoptic Segmentation and Tracking	Sep 8, 2021	BenchmarkingDiversity	CodeCode Available	2
Improving Building Segmentation for Off-Nadir Satellite Imagery	Sep 8, 2021	Scene UnderstandingSegmentation	—Unverified	0
Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds	Sep 6, 2021	Scene UnderstandingSuper-Resolution	—Unverified	0
Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds	Sep 1, 2021	3D Object Detection3D Point Cloud Classification	CodeCode Available	1
From General to Specific: Informative Scene Graph Generation via Balance Adjustment	Aug 30, 2021	BlockingGraph Generation	CodeCode Available	1
Multi-task learning from fixed-wing UAV images for 2D/3D city modeling	Aug 25, 2021	Change DetectionDepth Estimation	—Unverified	0
DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization	Aug 24, 2021	DiversityGraph Neural Network	CodeCode Available	1
Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks	Aug 23, 2021	Face RecognitionObject Recognition	—Unverified	0
ODAM: Object Detection, Association, and Mapping using Posed RGB Video	Aug 23, 2021	3D Object DetectionGraph Neural Network	CodeCode Available	1
A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces	Aug 20, 2021	3D ReconstructionPrediction	—Unverified	0
Patch2CAD: Patchwise Embedding Learning for In-the-Wild Shape Retrieval from a Single Image	Aug 20, 2021	RetrievalScene Understanding	—Unverified	0
Panoramic Depth Estimation via Supervised and Unsupervised Learning in Indoor Scenes	Aug 18, 2021	Camera CalibrationDepth Estimation	CodeCode Available	0
Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization	Aug 18, 2021	2D Object Detection3D Object Detection	—Unverified	0
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree Networks	Aug 17, 2021	3D Instance SegmentationInstance Segmentation	CodeCode Available	1
A Hybrid Sparse-Dense Monocular SLAM System for Autonomous Driving	Aug 17, 2021	Autonomous DrivingDepth Estimation	CodeCode Available	1
Indoor Semantic Scene Understanding using Multi-modality Fusion	Aug 17, 2021	Scene Understanding	—Unverified	0
UniNet: A Unified Scene Understanding Network and Exploring Multi-Task Relationships through the Lens of Adversarial Attacks	Aug 10, 2021	Depth EstimationDepth Prediction	CodeCode Available	0
Learning Multi-Granular Spatio-Temporal Graph Network for Skeleton-based Action Recognition	Aug 10, 2021	Action ClassificationAction Recognition	CodeCode Available	1
Self-supervised Learning of Occlusion Aware Flow Guided 3D Geometry Perception with Adaptive Cross Weighted Loss from Monocular Videos	Aug 9, 2021	3D geometry3D Geometry Perception	—Unverified	0
One-Shot Object Affordance Detection in the Wild	Aug 8, 2021	Action RecognitionAffordance Detection	CodeCode Available	1
Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images	Aug 6, 2021	Depth EstimationPanoptic Segmentation	CodeCode Available	1
Interpretable Visual Understanding with Cognitive Attention Network	Aug 6, 2021	Scene UnderstandingVisual Commonsense Reasoning	CodeCode Available	0
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection	Jul 30, 2021	3D Object Detectionobject-detection	CodeCode Available	1
CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation	Jul 29, 2021	Depth EstimationMonocular Depth Estimation	—Unverified	0
Arabic Scene Text Recognition in the Deep Learning Era: Analysis on A Novel Dataset	Jul 27, 2021	Scene Text RecognitionScene Understanding	CodeCode Available	1
ReDAL: Region-based and Diversity-aware Active Learning for Point Cloud Semantic Segmentation	Jul 25, 2021	Active LearningDeep Learning	CodeCode Available	1
Class-Incremental Domain Adaptation with Smoothing and Calibration for Surgical Report Generation	Jul 23, 2021	Domain AdaptationFew-Shot Learning	CodeCode Available	1
Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds	Jul 23, 2021	Point Cloud SegmentationScene Understanding	—Unverified	0
Photon-Starved Scene Inference using Single Photon Cameras	Jul 23, 2021	Depth Estimationimage-classification	CodeCode Available	1
Weighted Intersection over Union (wIoU) for Evaluating Image Segmentation	Jul 21, 2021	Image Segmentationobject-detection	CodeCode Available	0
Generative Video Transformer: Can Objects be the Words?	Jul 20, 2021	GPUScene Understanding	—Unverified	0
Accelerating deep neural networks for efficient scene understanding in automotive cyber-physical systems	Jul 19, 2021	Model Compressionobject-detection	—Unverified	0
CodeMapping: Real-Time Dense Mapping for Sparse SLAM using Compact Scene Representations	Jul 19, 2021	3D ReconstructionDepth Estimation	—Unverified	0
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference	Jul 16, 2021	Scene UnderstandingSegmentation	—Unverified	0
SynPick: A Dataset for Dynamic Bin Picking Scene Understanding	Jul 10, 2021	ARCDataset Generation	CodeCode Available	1
A Weakly-Supervised Depth Estimation Network Using Attention Mechanism	Jul 10, 2021	Depth EstimationMonocular Depth Estimation	—Unverified	0
Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting	Jul 6, 2021	3D Object DetectionAutonomous Driving	CodeCode Available	0
Empowering cyberphysical systems of systems with intelligence	Jul 5, 2021	Decision MakingManagement	—Unverified	0
Hybrid Memoised Wake-Sleep: Approximate Inference at the Discrete-Continuous Interface	Jul 4, 2021	Scene UnderstandingTime Series	—Unverified	0
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory	Jul 4, 2021	Question AnsweringScene Understanding	CodeCode Available	0

Show:10 25 50

← PrevPage 23 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified