Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–975 of 1723 papers

Title	Date	Tasks	Status
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation	Sep 7, 2020	Autonomous DrivingDomain Adaptation	—Unverified
ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding	Jun 30, 2024	Graph GenerationGraph Neural Network	—Unverified
Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks	May 8, 2016	Depth EstimationGeneral Classification	—Unverified
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users	Mar 28, 2025	Object RecognitionReading Comprehension	—Unverified
Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Oct 9, 2024	ColorizationPoint Cloud Segmentation	—Unverified
Evaluation of Multimodal Semantic Segmentation using RGB-D Data	Mar 31, 2021	Scene UnderstandingSemantic Segmentation	—Unverified
Event fields: Capturing light fields at high speed, resolution, and dynamic range	Dec 9, 2024	Depth EstimationScene Understanding	—Unverified
Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond	Mar 3, 2025	Infrared And Visible Image FusionScene Understanding	—Unverified
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images	Mar 6, 2025	Depth EstimationDepth Prediction	—Unverified
EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Jun 20, 2024	Autonomous VehiclesDecoder	—Unverified
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail	Mar 21, 2025	ObjectScene Understanding	—Unverified
Exosense: A Vision-Based Scene Understanding System For Exoskeletons	Mar 21, 2024	Language ModellingMotion Planning	—Unverified
Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception	Aug 31, 2023	Activity RecognitionHuman Activity Recognition	—Unverified
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks	Apr 17, 2025	Autonomous DrivingScene Understanding	—Unverified
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection	Feb 13, 2023	3D Object DetectionGraph Generation	—Unverified
Exploiting High Level Scene Cues in Stereo Reconstruction	Dec 1, 2015	3D ReconstructionScene Understanding	—Unverified
Exploiting Temporal Coherence for Multi-modal Video Categorization	Feb 7, 2020	object-detectionObject Detection	—Unverified
Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks	Jun 13, 2020	Action RecognitionObject Recognition	—Unverified
Explore and Tell: Embodied Visual Captioning in 3D Environments	Aug 21, 2023	Image CaptioningNavigate	—Unverified
Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding	Nov 29, 2020	Scene UnderstandingSemantic Segmentation	—Unverified
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection	Jan 11, 2024	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding	Jun 9, 2022	Common Sense ReasoningScene Understanding	—Unverified
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset	Mar 16, 2020	Material RecognitionObject Recognition	—Unverified
Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast	Mar 11, 2023	3D Semantic SegmentationContrastive Learning	—Unverified
Factored Neural Representation for Scene Understanding	Apr 21, 2023	Novel View SynthesisObject	—Unverified

Show:10 25 50

← PrevPage 39 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified