Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–625 of 1723 papers

Title	Date	Tasks	Status
Boundary Seeking GANs	Jan 1, 2018	Scene UnderstandingText Generation	—Unverified
InLUT3D: Challenging real indoor dataset for point cloud analysis	Jul 22, 2024	BenchmarkingScene Understanding	—Unverified
DriveGenVLM: Real-world Video Generation for Vision Language Model based Autonomous Driving	Aug 29, 2024	Autonomous DrivingDenoising	—Unverified
DreamAnywhere: Object-Centric Panoramic 3D Scene Generation	Jun 25, 2025	Novel View SynthesisObject	—Unverified
Bottom-up Instance Segmentation using Deep Higher-Order CRFs	Sep 8, 2016	Instance SegmentationObject	—Unverified
3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map	Dec 10, 2021	Autonomous VehiclesNavigate	—Unverified
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization	Jul 19, 2018	object-detectionObject Detection	—Unverified
DORSal: Diffusion for Object-centric Representations of Scenes et al	Jun 13, 2023	Neural RenderingObject	—Unverified
DORAEMON: Decentralized Ontology-aware Reliable Agent with Enhanced Memory Oriented Navigation	May 28, 2025	Autonomous NavigationRAG	—Unverified
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding	Dec 1, 2021	DisentanglementDomain Adaptation	—Unverified
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs	Jun 5, 2025	cross-modal alignmentDense Captioning	—Unverified
Does CLIP perceive art the same way we do?	May 8, 2025	Image GenerationScene Understanding	—Unverified
Both Style and Distortion Matter: Dual-Path Unsupervised Domain Adaptation for Panoramic Semantic Segmentation	Mar 25, 2023	Domain AdaptationERP	—Unverified
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions	Sep 11, 2018	Question AnsweringScene Understanding	—Unverified
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?	Mar 31, 2019	Human-Object Interaction DetectionObject	—Unverified
Answerability Fields: Answerable Location Estimation via Diffusion Models	Jul 26, 2024	Question AnsweringScene Understanding	—Unverified
Indoor Semantic Scene Understanding using Multi-modality Fusion	Aug 17, 2021	Scene Understanding	—Unverified
Inferring Shared Attention in Social Scene Videos	Jun 1, 2018	Scene Understanding	—Unverified
In-Place Panoptic Radiance Field Segmentation with Perceptual Prior for 3D Scene Understanding	Oct 6, 2024	2D Panoptic SegmentationAutonomous Driving	—Unverified
Interactive Occlusion Boundary Estimation through Exploitation of Synthetic Data	Aug 27, 2024	Domain AdaptationScene Understanding	—Unverified
DIV-FF: Dynamic Image-Video Feature Fields For Environment Understanding in Egocentric Videos	Mar 11, 2025	Scene Understanding	—Unverified
Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation	May 11, 2025	Autonomous DrivingDomain Adaptation	—Unverified
Distraction-Aware Shadow Detection	Jun 1, 2019	Scene UnderstandingShadow Detection	—Unverified
DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features	Jun 17, 2024	3D geometry3D Semantic Occupancy Prediction	—Unverified
An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles	Dec 10, 2018	Autonomous DrivingAutonomous Vehicles	—Unverified

Show:10 25 50

← PrevPage 25 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified