Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 751–775 of 1723 papers

Title	Date	Tasks	Status
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition	Dec 14, 2021	Human-Object Interaction DetectionScene Understanding	—Unverified
Improving Building Segmentation for Off-Nadir Satellite Imagery	Sep 8, 2021	Scene UnderstandingSegmentation	—Unverified
Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds	Jul 23, 2021	Point Cloud SegmentationScene Understanding	—Unverified
Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation	Mar 10, 2020	Domain AdaptationScene Understanding	—Unverified
Improving 6D Object Pose Estimation of metallic Household and Industry Objects	Mar 5, 2025	6D Pose Estimation using RGBPose Estimation	—Unverified
Deep ensembles based on Stochastic Activation Selection for Polyp Segmentation	Apr 2, 2021	Autonomous DrivingDecoder	—Unverified
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding	Dec 23, 2024	3D Semantic SegmentationScene Understanding	—Unverified
A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces	Aug 20, 2021	3D ReconstructionPrediction	—Unverified
Looking Beyond the Visible Scene	Jun 1, 2014	Scene Understanding	—Unverified
IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement	Jun 29, 2021	2D Semantic Segmentation3D Semantic Scene Completion	—Unverified
Image-to-Height Domain Translation for Synthetic Aperture Sonar	Dec 12, 2021	Generative Adversarial NetworkScene Understanding	—Unverified
Deep cross-domain building extraction for selective depth estimation from oblique aerial imagery	Apr 23, 2018	3D ReconstructionDepth Estimation	—Unverified
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding	Sep 26, 2023	Scene UnderstandingSimultaneous Localization and Mapping	—Unverified
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving	Jan 7, 2025	Autonomous DrivingContrastive Learning	—Unverified
Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges	Oct 20, 2024	Autonomous DrivingDecision Making	—Unverified
Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm	Nov 16, 2024	Autonomous VehiclesDecision Making	—Unverified
Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems	Jun 17, 2025	Autonomous DrivingImage Segmentation	—Unverified
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified
LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning	Jun 29, 2016	General ClassificationPedestrian Detection	—Unverified
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment	Jun 17, 2025	Autonomous DrivingInstance Segmentation	—Unverified
Image Parsing with Stochastic Scene Grammar	Dec 1, 2011	ClusteringScene Labeling	—Unverified
Learning 3D Robotics Perception using Inductive Priors	May 30, 2024	3D ReconstructionImage Generation	—Unverified
Deep Contextual Attention for Human-Object Interaction Detection	Oct 17, 2019	Human-Object Interaction DetectionObject	—Unverified
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions	Apr 8, 2020	3d scene graph generation3D Semantic Segmentation	—Unverified
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding	Mar 16, 2016	ObjectScene Understanding	—Unverified

Show:10 25 50

← PrevPage 31 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified