Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1051–1100 of 1723 papers

Title	Date	Tasks	Status
DAWN: Vehicle Detection in Adverse Weather Nature Dataset	Aug 12, 2020	Autonomous DrivingScene Understanding	—Unverified
Data-Driven Scene Understanding with Adaptively Retrieved Exemplars	Feb 3, 2015	Scene UnderstandingSemantic Segmentation	—Unverified
OpenSplat3D: Open-Vocabulary 3D Instance Segmentation using Gaussian Splatting	Jun 9, 2025	3DGS3D Instance Segmentation	—Unverified
OpenSU3D: Open World 3D Scene Understanding using Foundation Models	Jul 19, 2024	Scene UnderstandingSpatial Reasoning	—Unverified
OpenSUN3D: 1st Workshop Challenge on Open-Vocabulary 3D Scene Understanding	Feb 23, 2024	Scene Understanding	—Unverified
Open Vocabulary 3D Scene Understanding via Geometry Guided Self-Distillation	Jul 18, 2024	Knowledge DistillationRepresentation Learning	—Unverified
Open-Vocabulary Octree-Graph for 3D Scene Understanding	Nov 25, 2024	ObjectScene Understanding	—Unverified
Open-Vocabulary SAM3D: Towards Training-free Open-Vocabulary 3D Scene Understanding	May 24, 2024	Scene UnderstandingZero Shot Segmentation	—Unverified
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments	Mar 29, 2025	NavigateOpen Vocabulary Semantic Segmentation	—Unverified
OW-Rep: Open World Object Detection with Instance Representation Learning	Sep 24, 2024	Novel Class DiscoveryObject	—Unverified
Optical flow and scene flow estimation: A survey	Feb 1, 2021	Action RecognitionAutonomous Driving	—Unverified
Optimizing 3D Gaussian Splatting for Sparse Viewpoint Scene Reconstruction	Sep 5, 2024	3DGS3D Reconstruction	—Unverified
DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference	Jul 16, 2021	Scene UnderstandingSegmentation	—Unverified
DaF-BEVSeg: Distortion-aware Fisheye Camera based Bird's Eye View Segmentation with Occlusion Reasoning	Apr 9, 2024	BEV SegmentationScene Understanding	—Unverified
DAE-Fuse: An Adaptive Discriminative Autoencoder for Multi-Modality Image Fusion	Sep 16, 2024	Autonomous DrivingAutonomous Navigation	—Unverified
Using Image Priors to Improve Scene Understanding	Oct 2, 2019	Autonomous DrivingAutonomous Vehicles	—Unverified
Out of the Room: Generalizing Event-Based Dynamic Motion Segmentation for Complex Scenes	Mar 7, 2024	Motion SegmentationOptical Flow Estimation	—Unverified
CYCLO: Cyclic Graph Transformer Approach to Multi-Object Relationship Modeling in Aerial Videos	Jun 3, 2024	Graph GenerationScene Graph Generation	—Unverified
V3LMA: Visual 3D-enhanced Language Model for Autonomous Driving	Apr 30, 2025	Autonomous DrivingDecision Making	—Unverified
Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation	Apr 2, 2025	3D Semantic SegmentationAdversarial Attack	—Unverified
Accelerating deep neural networks for efficient scene understanding in automotive cyber-physical systems	Jul 19, 2021	Model Compressionobject-detection	—Unverified
Cross-modal Learning for Multi-modal Video Categorization	Mar 7, 2020	Activity Recognitionobject-detection	—Unverified
Panoptic Edge Detection	Jun 3, 2019	Edge Detectionobject-detection	—Unverified
Cross-Dataset Collaborative Learning for Semantic Segmentation in Autonomous Driving	Mar 21, 2021	3D Semantic SegmentationAutonomous Driving	—Unverified
COUNT Forest: CO-Voting Uncertain Number of Targets Using Random Forest for Crowd Density Estimation	Dec 1, 2015	Density EstimationScene Understanding	—Unverified
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding	Dec 24, 2020	Contrastive LearningRepresentation Learning	—Unverified
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing	May 11, 2018	Depth EstimationMulti-Task Learning	—Unverified
PADriver: Towards Personalized Autonomous Driving	May 8, 2025	Autonomous DrivingLanguage Modeling	—Unverified
PAg-NeRF: Towards fast and efficient end-to-end panoptic 3D representations for agricultural robotics	Sep 11, 2023	3D ReconstructionCamera Localization	—Unverified
PanoContext-Former: Panoramic Total Scene Understanding with a Transformer	May 21, 2023	3D Object Detectionobject-detection	—Unverified
PanoGS: Gaussian-based Panoptic Segmentation for 3D Open Vocabulary Scene Understanding	Mar 23, 2025	3DGSDecoder	—Unverified
PanoMixSwap Panorama Mixing via Structural Swapping for Indoor Scene Understanding	Sep 18, 2023	Data AugmentationDiversity	—Unverified
CoT-Drive: Efficient Motion Forecasting for Autonomous Driving with LLMs and Chain-of-Thought Prompting	Mar 10, 2025	Autonomous DrivingKnowledge Distillation	—Unverified
Learning Segmented 3D Gaussians via Efficient Feature Unprojection for Zero-shot Neural Scene Segmentation	Jan 11, 2024	DecoderPanoptic Segmentation	—Unverified
CoPa-SG: Dense Scene Graphs with Parametric and Proto-Relations	Jun 26, 2025	Graph GenerationRelation	—Unverified
Convolutional Patch Networks with Spatial Prior for Road Detection and Urban Scene Understanding	Feb 23, 2015	Scene Understanding	—Unverified
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding	Jan 28, 2025	object-detectionObject Detection	—Unverified
Panoptic Out-of-Distribution Segmentation	Oct 18, 2023	Data AugmentationInstance Segmentation	—Unverified
Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation	Apr 6, 2024	Image CaptioningInstance Segmentation	—Unverified
PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction	Jul 1, 2024	3D Panoptic SegmentationInstance Segmentation	—Unverified
Context-Dependent Diffusion Network for Visual Relationship Detection	Sep 11, 2018	DiversityObject	—Unverified
Panoptic Segmentation Meets Remote Sensing	Nov 23, 2021	Panoptic SegmentationScene Understanding	—Unverified
PanopticSplatting: End-to-End Panoptic Gaussian Splatting	Mar 23, 2025	global-optimizationNeRF	—Unverified
Context-Aware Human Behavior Prediction Using Multimodal Large Language Models: Challenges and Insights	Apr 1, 2025	Activity PredictionDomain Generalization	—Unverified
Wireless Sensing With Deep Spectrogram Network and Primitive Based Autoregressive Hybrid Channel Model	Apr 21, 2021	Dataset GenerationScene Understanding	—Unverified
Content-Aware Preserving Image Generation	Nov 15, 2024	Image GenerationScene Understanding	—Unverified
Configurable 3D Scene Synthesis and 2D Image Rendering with Per-Pixel Ground Truth using Stochastic Grammars	Apr 1, 2017	BenchmarkingObject	—Unverified
Real-time Approximate Bayesian Computation for Scene Understanding	May 22, 2019	Scene Understanding	—Unverified
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds	Nov 28, 2021	3D Shape Classificationgraph construction	—Unverified
VideoGameBunny: Towards vision assistants for video games	Jul 21, 2024	Image CaptioningScene Understanding	—Unverified

Show:10 25 50

← PrevPage 22 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified