Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1723 papers

Title	Date	Tasks	Status
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation	Sep 7, 2020	Autonomous DrivingDomain Adaptation	—Unverified
ESGNN: Towards Equivariant Scene Graph Neural Network for 3D Scene Understanding	Jun 30, 2024	Graph GenerationGraph Neural Network	—Unverified
Estimating Depth from Monocular Images as Classification Using Deep Fully Convolutional Residual Networks	May 8, 2016	Depth EstimationGeneral Classification	—Unverified
Evaluating Multimodal Language Models as Visual Assistants for Visually Impaired Users	Mar 28, 2025	Object RecognitionReading Comprehension	—Unverified
Evaluating the Impact of Point Cloud Colorization on Semantic Segmentation Accuracy	Oct 9, 2024	ColorizationPoint Cloud Segmentation	—Unverified
Evaluation of Multimodal Semantic Segmentation using RGB-D Data	Mar 31, 2021	Scene UnderstandingSemantic Segmentation	—Unverified
Event fields: Capturing light fields at high speed, resolution, and dynamic range	Dec 9, 2024	Depth EstimationScene Understanding	—Unverified
Every SAM Drop Counts: Embracing Semantic Priors for Multi-Modality Image Fusion and Beyond	Mar 3, 2025	Infrared And Visible Image FusionScene Understanding	—Unverified
EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images	Mar 6, 2025	Depth EstimationDepth Prediction	—Unverified
EvSegSNN: Neuromorphic Semantic Segmentation for Event Data	Jun 20, 2024	Autonomous VehiclesDecoder	—Unverified
ExCap3D: Expressive 3D Scene Understanding via Object Captioning with Varying Detail	Mar 21, 2025	ObjectScene Understanding	—Unverified
Exosense: A Vision-Based Scene Understanding System For Exoskeletons	Mar 21, 2024	Language ModellingMotion Planning	—Unverified
Expanding Frozen Vision-Language Models without Retraining: Towards Improved Robot Perception	Aug 31, 2023	Activity RecognitionHuman Activity Recognition	—Unverified
Explainable Scene Understanding with Qualitative Representations and Graph Neural Networks	Apr 17, 2025	Autonomous DrivingScene Understanding	—Unverified
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection	Feb 13, 2023	3D Object DetectionGraph Generation	—Unverified
Exploiting High Level Scene Cues in Stereo Reconstruction	Dec 1, 2015	3D ReconstructionScene Understanding	—Unverified
Exploiting Temporal Coherence for Multi-modal Video Categorization	Feb 7, 2020	object-detectionObject Detection	—Unverified
Exploiting the ConvLSTM: Human Action Recognition using Raw Depth Video-Based Recurrent Neural Networks	Jun 13, 2020	Action RecognitionObject Recognition	—Unverified
Explore and Tell: Embodied Visual Captioning in 3D Environments	Aug 21, 2023	Image CaptioningNavigate	—Unverified
Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding	Nov 29, 2020	Scene UnderstandingSemantic Segmentation	—Unverified
Exploring Self- and Cross-Triplet Correlations for Human-Object Interaction Detection	Jan 11, 2024	Human-Object Interaction DetectionKnowledge Distillation	—Unverified
Extracting Zero-shot Common Sense from Large Language Models for Robot 3D Scene Understanding	Jun 9, 2022	Common Sense ReasoningScene Understanding	—Unverified
Fabric Surface Characterization: Assessment of Deep Learning-based Texture Representations Using a Challenging Dataset	Mar 16, 2020	Material RecognitionObject Recognition	—Unverified
Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast	Mar 11, 2023	3D Semantic SegmentationContrastive Learning	—Unverified
Factored Neural Representation for Scene Understanding	Apr 21, 2023	Novel View SynthesisObject	—Unverified
Factor Graph based 3D Multi-Object Tracking in Point Clouds	Aug 12, 2020	3D Multi-Object TrackingMulti-Object Tracking	—Unverified
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments	May 25, 2023	Continual LearningContinual Semantic Segmentation	—Unverified
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding	Nov 27, 2023	Continual LearningContinual Semantic Segmentation	—Unverified
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping	Jun 4, 2024	3DGSScene Understanding	—Unverified
Fast Neural Architecture Search for Lightweight Dense Prediction Networks	Mar 3, 2022	Depth EstimationImage Super-Resolution	—Unverified
Fast Object Detection with a Machine Learning Edge Device	Oct 5, 2024	Autonomous NavigationCPU	—Unverified
Feature discovery and visualization of robot mission data using convolutional autoencoders and Bayesian nonparametric topic models	Nov 30, 2017	Scene UnderstandingTopic Models	—Unverified
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction	Mar 8, 2025	3DGSimage-classification	—Unverified
Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion	Jun 19, 2021	Camera Pose EstimationDecoder	—Unverified
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Oct 6, 2022	Scene Understanding	—Unverified
FHGS: Feature-Homogenized Gaussian Splatting	May 25, 2025	3DGSScene Understanding	—Unverified
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment	Apr 11, 2025	3D geometryNatural Language Queries	—Unverified
Fine-Grained Off-Road Semantic Segmentation and Mapping via Contrastive Learning	Mar 5, 2021	Binary ClassificationContrastive Learning	—Unverified
FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation	Feb 13, 2025	Autonomous DrivingLIDAR Semantic Segmentation	—Unverified
Floorplan-SLAM: A Real-Time, High-Accuracy, and Long-Term Multi-Session Point-Plane SLAM for Efficient Floorplan Reconstruction	Mar 1, 2025	GPUPose Estimation	—Unverified
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition	Nov 8, 2020	Action RecognitionOptical Flow Estimation	—Unverified
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding	Jan 3, 2024	object-detectionObject Detection	—Unverified
FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents	Apr 11, 2025	3DGSNavigate	—Unverified
Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization	Apr 14, 2025	BenchmarkingEarth Observation	—Unverified
Framework for 2D Ad placements in LinearTV	Dec 5, 2022	Occlusion HandlingScene Understanding	—Unverified
FreeQ-Graph: Free-form Querying with Semantic Consistent Scene Graph for 3D Scene Understanding	Jun 16, 2025	FormGraph Generation	—Unverified
Friction from Reflectance: Deep Reflectance Codes for Predicting Physical Surface Properties from One-Shot In-Field Reflectance	Mar 25, 2016	FrictionScene Understanding	—Unverified
FroDO: From Detections to 3D Objects	May 11, 2020	3D ReconstructionObject	—Unverified
FroDO: From Detections to 3D Objects	Jun 1, 2020	3D ReconstructionObject	—Unverified
From Flight to Insight: Semantic 3D Reconstruction for Aerial Inspection via Gaussian Splatting and Language-Guided Segmentation	May 23, 2025	3DGS3D Reconstruction	—Unverified

Show:10 25 50

← PrevPage 20 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified