Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1525 of 1723 papers

Title	Date	Tasks	Status
On the iterative refinement of densely connected representation levels for semantic segmentation	Apr 30, 2018	Image SegmentationScene Understanding	CodeCode Available
One model to use them all: Training a segmentation model with complementary datasets	Feb 29, 2024	AllAnatomy	CodeCode Available
Image interpretation by iterative bottom-up top-down processing	May 12, 2021	Scene Understanding	CodeCode Available
Depth-Induced Multi-Scale Recurrent Attention Network for Saliency Detection	Oct 1, 2019	RGB-D Salient Object DetectionSaliency Detection	CodeCode Available
PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds	Mar 18, 2025	3D Object Detection3D Semantic Segmentation	CodeCode Available
Adapting Deep Network Features to Capture Psychological Representations	Aug 6, 2016	Object RecognitionScene Understanding	CodeCode Available
Single Image 3D Object Estimation with Primitive Graph Networks	Sep 9, 2021	Graph Neural NetworkObject	CodeCode Available
Towards Ambiguity-Free Spatial Foundation Model: Rethinking and Decoupling Depth Ambiguity	Mar 8, 2025	Depth EstimationScene Understanding	CodeCode Available
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields	Mar 17, 2024	3D ReconstructionNeRF	CodeCode Available
Unsupervised Single-shot Depth Estimation using Perceptual Reconstruction	Jan 28, 2022	3D ReconstructionDepth Estimation	CodeCode Available
Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency	Jan 1, 2024	3D visual groundingRelation	CodeCode Available
Quantitative Depth Quality Assessment of RGBD Cameras At Close Range Using 3D Printed Fixtures	Mar 21, 2019	Scene Understanding	CodeCode Available
Single Network Panoptic Segmentation for Street Scene Understanding	Feb 7, 2019	Instance SegmentationPanoptic Segmentation	CodeCode Available
Image-based Navigation in Real-World Environments via Multiple Mid-level Representations: Fusion Models, Benchmark and Efficient Evaluation	Feb 2, 2022	PointGoal NavigationScene Understanding	CodeCode Available
Object-aware Sound Source Localization via Audio-Visual Scene Understanding	Jan 1, 2025	Scene UnderstandingSound Source Localization	CodeCode Available
Single Shot Scene Text Retrieval	Aug 27, 2018	Image RetrievalRetrieval	CodeCode Available
Beyond Human Perception: Understanding Multi-Object World from Monocular View	Jan 1, 2025	3D visual groundingDenoising	CodeCode Available
The Ikshana Hypothesis of Human Scene Understanding	Jan 21, 2021	Representation LearningScene Understanding	CodeCode Available
Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly Detection	Jan 25, 2019	Anomaly DetectionDecoder	CodeCode Available
DenseASPP for Semantic Segmentation in Street Scenes	Jun 1, 2018	Autonomous DrivingImage Segmentation	CodeCode Available
Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph Propagation	Jan 1, 2022	Point Cloud SegmentationScene Understanding	CodeCode Available
Towards Improving the Generation Quality of Autoregressive Slot VAEs	Jun 3, 2022	Image GenerationObject	CodeCode Available
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR Sequences	Apr 2, 2019	3D Semantic SegmentationScene Understanding	CodeCode Available
Deep Video Deblurring for Hand-Held Cameras	Jul 1, 2017	DeblurringImage Deblurring	CodeCode Available
RCNet: Deep Recurrent Collaborative Network for Multi-View Low-Light Image Enhancement	Sep 6, 2024	Image EnhancementLow-Light Image Enhancement	CodeCode Available

Show:10 25 50

← PrevPage 61 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified