Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1301–1325 of 1723 papers

Title	Date	Tasks	Status
Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation	Jun 1, 2019	Depth EstimationMonocular Depth Estimation	—Unverified
Towards Scene Understanding with Detailed 3D Object Representations	Nov 18, 2014	3D Pose EstimationObject	—Unverified
Towards seamless multi-view scene analysis from satellite to street-level	May 23, 2017	Change DetectionEarth Observation	—Unverified
Towards Trustworthy Automated Driving through Qualitative Scene Understanding and Explanations	Mar 25, 2024	Scene Understanding	—Unverified
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation	Jun 23, 2023	Graph GenerationScene Graph Generation	—Unverified
Towards urban scenes understanding through polarization cues	Jun 3, 2021	Depth EstimationScene Understanding	—Unverified
Training-Free Model Merging for Multi-target Domain Adaptation	Jul 18, 2024	Domain AdaptationMulti-target Domain Adaptation	—Unverified
TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication	Apr 2, 2025	Language ModelingLanguage Modelling	—Unverified
Transformers for Image-Goal Navigation	May 23, 2024	NavigateScene Understanding	—Unverified
TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance	Apr 23, 2025	Question AnsweringScene Understanding	—Unverified
TS40K: a 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission System	May 22, 2024	3D Object Detection3D Semantic Segmentation	—Unverified
Two Stream Scene Understanding on Graph Embedding	Nov 12, 2023	Graph AttentionGraph Embedding	—Unverified
U4D: Unsupervised 4D Dynamic Scene Understanding	Jul 23, 2019	3D Pose EstimationInstance Segmentation	—Unverified
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information	Feb 25, 2023	DecoderImage Segmentation	—Unverified
Understanding and Evaluating Hallucinations in 3D Visual Language Models	Feb 18, 2025	DiversityScene Understanding	—Unverified
Understanding Bayesian Rooms Using Composite 3D Object Models	Jun 1, 2013	ObjectObject Recognition	—Unverified
Understanding Indoor Scenes Using 3D Geometric Phrases	Jun 1, 2013	General ClassificationObject	—Unverified
Understanding Pedestrian Behaviors From Stationary Crowd Groups	Jun 1, 2015	Event DetectionScene Understanding	—Unverified
Understanding Real World Indoor Scenes With Synthetic Data	Jun 1, 2016	Scene Understanding	—Unverified
Understand Scene Categories by Objects: A Semantic Regularized Scene Classifier Using Convolutional Neural Networks	Sep 22, 2015	ClassificationDiversity	—Unverified
Underwater Diffusion Attention Network with Contrastive Language-Image Joint Learning for Underwater Image Enhancement	May 26, 2025	Image Enhancementobject-detection	—Unverified
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs	Mar 3, 2023	Depth-aware Video Panoptic SegmentationPanoptic Segmentation	—Unverified
Unified Representation Space for 3D Visual Grounding	Jun 17, 2025	3D visual groundingContrastive Learning	—Unverified
Unified Scene Representation and Reconstruction for 3D Large Language Models	Apr 19, 2024	3D ReconstructionScene Understanding	—Unverified
Uni-Fusion: Universal Continuous Mapping	Mar 22, 2023	Scene Understanding	—Unverified

Show:10 25 50

← PrevPage 53 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified