Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1501–1525 of 1723 papers

Title	Date	Tasks	Status
Taskology: Utilizing Task Relations at Scale	May 14, 2020	Depth EstimationMotion Estimation	—Unverified
Application of Vision-Language Model to Pedestrians Behavior and Scene Understanding in Autonomous Driving	Jan 12, 2025	Autonomous DrivingDecision Making	—Unverified
TB-HSU: Hierarchical 3D Scene Understanding with Contextual Affordances	Dec 7, 2024	Multi-Task LearningObject	—Unverified
Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding	May 11, 2025	2D Semantic SegmentationDenoising	—Unverified
Temporal DINO: A Self-supervised Video Strategy to Enhance Action Prediction	Aug 8, 2023	Activity RecognitionAutonomous Driving	—Unverified
3DArticCyclists: Generating Synthetic Articulated 8D Pose-Controllable Cyclist Data for Computer Vision Applications	Oct 14, 2024	3DGS3D Reconstruction	—Unverified
Temporal Propagation of Asymmetric Feature Pyramid for Surgical Scene Segmentation	Apr 18, 2025	Scene SegmentationScene Understanding	—Unverified
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models	Oct 10, 2023	ObjectObject Tracking	—Unverified
Application of Multimodal Large Language Models in Autonomous Driving	Dec 21, 2024	Autonomous DrivingDecision Making	—Unverified
Test-Time Adaptation for Nighttime Color-Thermal Semantic Segmentation	Jul 10, 2023	Scene UnderstandingSemantic Segmentation	—Unverified
Test-Time Intensity Consistency Adaptation for Shadow Detection	Oct 10, 2024	DecoderDiversity	—Unverified
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions	Oct 10, 2023	Graph GenerationPanoptic Scene Graph Generation	—Unverified
A pooling based scene text proposal technique for scene text reading in the wild	Nov 25, 2018	Scene UnderstandingText Spotting	—Unverified
APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation	Mar 2, 2023	Autonomous DrivingAutonomous Navigation	—Unverified
Text-to-Image GAN with Pretrained Representations	Dec 30, 2024	Domain GeneralizationImage Generation	—Unverified
Anticipating Object State Changes in Long Procedural Videos	May 21, 2024	ObjectObject State Change Classification	—Unverified
Texture Underfitting for Domain Adaptation	Aug 29, 2019	Autonomous DrivingDomain Adaptation	—Unverified
TGOSPA Metric Parameters Selection and Evaluation for Visual Multi-object Tracking	Dec 11, 2024	Multi-Object TrackingObject Tracking	—Unverified
TGP: Two-modal occupancy prediction with 3D Gaussian and sparse points for 3D Environment Awareness	Mar 13, 2025	Autonomous DrivingPrediction	—Unverified
What Can I Do Around Here? Deep Functional Scene Understanding for Cognitive Robots	Jan 29, 2016	image-classificationImage Classification	—Unverified
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions	Sep 11, 2018	Question AnsweringScene Understanding	—Unverified
Answerability Fields: Answerable Location Estimation via Diffusion Models	Jul 26, 2024	Question AnsweringScene Understanding	—Unverified
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation	Nov 26, 2020	Instance SegmentationScene Understanding	—Unverified
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes	Mar 4, 2019	3D Object DetectionObject	—Unverified
An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles	Dec 10, 2018	Autonomous DrivingAutonomous Vehicles	—Unverified

Show:10 25 50

← PrevPage 61 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified