Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1251–1300 of 1723 papers

Title	Date	Tasks	Status
SurroundSDF: Implicit 3D Scene Understanding Based on Signed Distance Field	Mar 21, 2024	3D Scene ReconstructionAutonomous Driving	—Unverified
Survey of Action Recognition, Spotting and Spatio-Temporal Localization in Soccer -- Current Trends and Research Perspectives	Sep 21, 2023	Action LocalizationAction Recognition	—Unverified
Symbolic Graph Inference for Compound Scene Understanding	Oct 30, 2024	Question AnsweringScene Understanding	—Unverified
Synergizing Contrastive Learning and Optimal Transport for 3D Point Cloud Domain Adaptation	Aug 27, 2023	Contrastive LearningDomain Adaptation	—Unverified
Syn-Mediverse: A Multimodal Synthetic Dataset for Intelligent Scene Understanding of Healthcare Facilities	Aug 6, 2023	Depth EstimationInstance Segmentation	—Unverified
SynthCam3D: Semantic Understanding With Synthetic Indoor Scenes	May 1, 2015	Scene UnderstandingSegmentation	—Unverified
Synthetic and Real Inputs for Tool Segmentation in Robotic Surgery	Jul 17, 2020	Deep LearningScene Understanding	—Unverified
Tactical Decision for Multi-UGV Confrontation with a Vision-Language Model-Based Commander	Jul 15, 2025	Language ModelingLanguage Modelling	—Unverified
Tactile MNIST: Benchmarking Active Tactile Perception	Jun 3, 2025	BenchmarkingScene Understanding	—Unverified
TADFormer : Task-Adaptive Dynamic Transformer for Efficient Multi-Task Learning	Jan 8, 2025	Multi-Task Learningparameter-efficient fine-tuning	—Unverified
TADFormer: Task-Adaptive Dynamic TransFormer for Efficient Multi-Task Learning	Jan 1, 2025	Multi-Task Learningparameter-efficient fine-tuning	—Unverified
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot	Nov 22, 2021	Scene Understanding	—Unverified
TanDepth: Leveraging Global DEMs for Metric Monocular Depth Estimation in UAVs	Sep 8, 2024	Depth EstimationMonocular Depth Estimation	—Unverified
TARS: Traffic-Aware Radar Scene Flow Estimation	Mar 13, 2025	Autonomous Drivingobject-detection	—Unverified
Taskology: Utilizing Task Relations at Scale	May 14, 2020	Depth EstimationMotion Estimation	—Unverified
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances	Dec 7, 2024	Multi-Task LearningObject	—Unverified
Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding	May 11, 2025	2D Semantic SegmentationDenoising	—Unverified
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction	Aug 8, 2023	Activity RecognitionAutonomous Driving	—Unverified
Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation	Apr 18, 2025	Scene SegmentationScene Understanding	—Unverified
Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation	Jul 10, 2023	Scene UnderstandingSemantic Segmentation	—Unverified
Test-Time Intensity Consistency Adaptation for Shadow Detection	Oct 10, 2024	DecoderDiversity	—Unverified
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions	Oct 10, 2023	Graph GenerationPanoptic Scene Graph Generation	—Unverified
Text-to-Image GAN with Pretrained Representations	Dec 30, 2024	Domain GeneralizationImage Generation	—Unverified
Texture Underfitting for Domain Adaptation	Aug 29, 2019	Autonomous DrivingDomain Adaptation	—Unverified
TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking	Dec 11, 2024	Multi-Object TrackingObject Tracking	—Unverified
TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness	Mar 13, 2025	Autonomous DrivingPrediction	—Unverified
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation	Nov 26, 2020	Instance SegmentationScene Understanding	—Unverified
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes	Mar 4, 2019	3D Object DetectionObject	—Unverified
The Mapillary Vistas Dataset for Semantic Understanding of Street Scenes	Oct 1, 2017	DiversityImage Segmentation	—Unverified
The Right (Angled) Perspective: Improving the Understanding of Road Scenes Using Boosted Inverse Perspective Mapping	Dec 3, 2018	Autonomous VehiclesObject Tracking	—Unverified
These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models	Mar 18, 2025	Decision MakingScene Understanding	—Unverified
-MRF: Capturing Spatial and Semantic Structure in the Parameters for Scene Understanding	Dec 1, 2011	Depth Estimationobject-detection	—Unverified
The toulouse vanishing points dataset	Mar 11, 2015	FormScene Understanding	—Unverified
TinyRS-R1: Compact Multimodal Language Model for Remote Sensing	May 17, 2025	Language ModelingLanguage Modelling	—Unverified
To complete or to estimate, that is the question: A Multi-Task Approach to Depth Completion and Monocular Depth Estimation	Aug 15, 2019	Autonomous DrivingDepth Completion	—Unverified
TopoMask: Instance-Mask-Based Formulation for the Road Topology Problem via Transformer-Based Architecture	Jun 8, 2023	3D Lane DetectionGraph Neural Network	—Unverified
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module	Aug 24, 2020	3D Semantic SegmentationAutonomous Driving	—Unverified
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning	Nov 6, 2018	Scene Understanding	—Unverified
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases	Jul 5, 2022	ObjectRepresentation Learning	—Unverified
Towards 3D Scene Understanding by Referring Synthetic Models	Mar 20, 2022	Scene UnderstandingTransfer Learning	—Unverified
Towards Adapting ImageNet to Reality: Scalable Domain Adaptation with Implicit Low-rank Transformations	Aug 20, 2013	Domain AdaptationScene Understanding	—Unverified
Towards A Unified Agent with Foundation Models	Jul 18, 2023	Efficient ExplorationReinforcement Learning (RL)	—Unverified
Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation	Dec 13, 2022	3D Semantic SegmentationScene Understanding	—Unverified
Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering	Mar 24, 2022	Optical Character RecognitionOptical Character Recognition (OCR)	—Unverified
Towards General Purpose Geometry-Preserving Single-View Depth Estimation	Sep 25, 2020	Depth EstimationDiversity	—Unverified
Towards Holistic Scene Understanding: Feedback Enabled Cascaded Classification Models	Dec 1, 2010	ClassificationDepth Estimation	—Unverified
Towards holistic scene understanding: Semantic segmentation and beyond	Jan 16, 2022	object-detectionObject Detection	—Unverified
Towards Localizing Structural Elements: Merging Geometrical Detection with Semantic Verification in RGB-D Data	Sep 10, 2024	3D Plane Detection3d scene graph generation	—Unverified
Towards Multimodal Multitask Scene Understanding Models for Indoor Mobile Agents	Sep 27, 2022	3D Object DetectionAutonomous Driving	—Unverified
Towards Robust Algorithms for Surgical Phase Recognition via Digital Twin-based Scene Representation	Oct 26, 2024	InformativenessScene Understanding	—Unverified

Show:10 25 50

← PrevPage 26 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified