Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 326–350 of 1723 papers

Title	Date	Tasks	Status	Hype
MSeg: A Composite Dataset for Multi-domain Semantic Segmentation	Dec 27, 2021	Computational EfficiencyInstance Segmentation	CodeCode Available	1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth Estimation	Dec 24, 2021	Depth EstimationDepth Prediction	CodeCode Available	1
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene Manipulation	Dec 22, 2021	Common Sense ReasoningQuestion Answering	CodeCode Available	1
ScanQA: 3D Question Answering for Spatial Scene Understanding	Dec 20, 2021	3D Question Answering (3D-QA)Object	CodeCode Available	1
Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic Segmentation	Dec 16, 2021	Feature ImportanceScene Understanding	CodeCode Available	1
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic Segmentation	Dec 5, 2021	Depth-aware Video Panoptic SegmentationDepth Estimation	CodeCode Available	1
Behind the Curtain: Learning Occluded Shapes for 3D Object Detection	Dec 4, 2021	3D Object DetectionObject	CodeCode Available	1
AirObject: A Temporally Evolving Graph Embedding for Object Identification	Nov 30, 2021	Graph AttentionGraph Embedding	CodeCode Available	1
Instance-wise Occlusion and Depth Orders in Natural Scenes	Nov 29, 2021	Depth EstimationDepth Prediction	CodeCode Available	1
Cerberus Transformer: Joint Semantic, Affordance and Attribute Parsing	Nov 24, 2021	AttributeScene Understanding	CodeCode Available	1
Grounded Situation Recognition with Transformers	Nov 19, 2021	DecoderGrounded Situation Recognition	CodeCode Available	1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D Data	Nov 17, 2021	3D Object Detectionobject-detection	CodeCode Available	1
Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views	Nov 13, 2021	ObjectScene Understanding	CodeCode Available	1
Panoptic 3D Scene Reconstruction From a Single RGB Image	Nov 3, 2021	2D Panoptic Segmentation3D Instance Segmentation	CodeCode Available	1
3DP3: 3D Scene Perception via Probabilistic Programming	Oct 30, 2021	ObjectPose Estimation	CodeCode Available	1
A Versatile and Efficient Reinforcement Learning Framework for Autonomous Driving	Oct 22, 2021	Autonomous Drivingreinforcement-learning	CodeCode Available	1
PlaneRecNet: Multi-Task Learning with Cross-Task Consistency for Piece-Wise Plane Detection and Reconstruction from a Single RGB Image	Oct 21, 2021	DecoderDepth Estimation	CodeCode Available	1
Structured Bird's-Eye-View Traffic Scene Understanding from Onboard Images	Oct 5, 2021	Autonomous NavigationLane Detection	CodeCode Available	1
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3D	Sep 28, 2021	Multiple Object TrackingNovel View Synthesis	CodeCode Available	1
Semantic Segmentation-assisted Scene Completion for LiDAR Point Clouds	Sep 23, 2021	3D Semantic Scene Completion3D Semantic Segmentation	CodeCode Available	1
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal Estimation	Sep 20, 2021	DecoderPrediction	CodeCode Available	1
PQ-Transformer: Jointly Parsing 3D Objects and Layouts from Point Clouds	Sep 12, 2021	object-detectionObject Detection	CodeCode Available	1
Spatio-temporal Self-Supervised Representation Learning for 3D Point Clouds	Sep 1, 2021	3D Object Detection3D Point Cloud Classification	CodeCode Available	1
From General to Specific: Informative Scene Graph Generation via Balance Adjustment	Aug 30, 2021	BlockingGraph Generation	CodeCode Available	1
DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based Optimization	Aug 24, 2021	DiversityGraph Neural Network	CodeCode Available	1

Show:10 25 50

← PrevPage 14 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified