Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 951–1000 of 1723 papers

Title	Date	Tasks	Status
MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation	Jul 10, 2025	NeRFObject	—Unverified
MVLidarNet: Real-Time Multi-Class Scene Understanding for Autonomous Driving Using Multiple Views	Jun 9, 2020	Autonomous DrivingDecoder	—Unverified
N2F2: Hierarchical Scene Understanding with Nested Neural Feature Fields	Mar 16, 2024	Scene Understanding	—Unverified
Natural Language Guided Visual Relationship Detection	Nov 16, 2017	Relationship DetectionScene Understanding	—Unverified
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset	Aug 25, 2018	Deep Reinforcement Learningreinforcement-learning	—Unverified
Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images	Sep 15, 2021	Autonomous NavigationDecision Making	—Unverified
DGOcc: Depth-aware Global Query-based Network for Monocular 3D Occupancy Prediction	Apr 10, 2025	GPUPrediction	—Unverified
Near, far: Patch-ordering enhances vision foundation models' scene understanding	Aug 20, 2024	GPUScene Understanding	—Unverified
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation	Sep 30, 2021	Autonomous DrivingAutonomous Vehicles	—Unverified
Designing DNNs for a trade-off between robustness and processing performance in embedded devices	Dec 4, 2024	Autonomous DrivingQuantization	—Unverified
Neural Implicit Dense Semantic SLAM	Apr 27, 2023	3D geometryScene Understanding	—Unverified
Neural Mesh Refiner for 6-DoF Pose Estimation	Mar 17, 2020	Autonomous DrivingInstance Segmentation	—Unverified
Neural Part Priors: Learning to Optimize Part-Based Object Completion in RGB-D Scans	Mar 17, 2022	3D Object Recognitionglobal-optimization	—Unverified
Neural Projection Mapping Using Reflectance Fields	Jun 11, 2023	Scene Understanding	—Unverified
Neural Radiance Field-based Visual Rendering: A Comprehensive Review	Mar 31, 2024	NeRFScene Understanding	—Unverified
Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding	Nov 30, 2021	Domain AdaptationLanguage Modeling	—Unverified
Neural Radiance Fields for the Real World: A Survey	Jan 22, 2025	Scene UnderstandingSurvey	—Unverified
Neural Rendering in a Room: Amodal 3D Understanding and Free-Viewpoint Rendering for the Closed Scene Composed of Pre-Captured Objects	May 5, 2022	Data AugmentationNeural Rendering	—Unverified
DEF-oriCORN: efficient 3D scene understanding for robust language-directed manipulation without demonstrations	Jul 31, 2024	Motion PlanningScene Understanding	—Unverified
Neural RGB(r)D Sensing: Depth and Uncertainty From a Video Camera	Jun 1, 2019	3D Reconstruction3D Scene Reconstruction	—Unverified
Neural Scene De-Rendering	Jul 1, 2017	DecoderImage Captioning	—Unverified
Neuromorphic Visual Scene Understanding with Resonator Networks	Aug 26, 2022	Scene UnderstandingTranslation	—Unverified
Designing Deep Networks for Surface Normal Estimation	Nov 18, 2014	Scene UnderstandingSurface Normal Estimation	—Unverified
Newtonian Scene Understanding: Unfolding the Dynamics of Objects in Static Images	Jun 1, 2016	ObjectScene Understanding	—Unverified
Next-Best-Trajectory Planning of Robot Manipulators for Effective Observation and Exploration	Mar 28, 2025	Computational EfficiencyObject Reconstruction	—Unverified
Unsupervised Foveal Vision Neural Networks with Top-Down Attention	Oct 18, 2020	ObjectObject Recognition	—Unverified
NIS-SLAM: Neural Implicit Semantic RGB-D SLAM for 3D Consistent Scene Understanding	Jul 30, 2024	Scene UnderstandingSimultaneous Localization and Mapping	—Unverified
Design and Evaluation of Deep Learning-Based Dual-Spectrum Image Fusion Methods	Jun 9, 2025	FairnessScene Understanding	—Unverified
Unsupervised Image Segmentation by Mutual Information Maximization and Adversarial Regularization	Jul 1, 2021	Image SegmentationScene Understanding	—Unverified
Non-maximum Suppression Also Closes the Variational Approximation Gap of Multi-object Variational Autoencoders	Jan 1, 2021	ObjectRepresentation Learning	—Unverified
Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation	Nov 26, 2021	AllGraph Generation	—Unverified
Depth Not Needed - An Evaluation of RGB-D Feature Encodings for Off-Road Scene Understanding by Convolutional Neural Network	Jan 4, 2018	Autonomous Vehiclesroad scene understanding	—Unverified
Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation	Dec 17, 2018	ClassificationData Augmentation	—Unverified
Not Using the Car to See the Sidewalk -- Quantifying and Controlling the Effects of Context in Classification and Segmentation	Jun 1, 2019	Data AugmentationGeneral Classification	—Unverified
Novel 3D Scene Understanding Applications From Recurrence in a Single Image	Oct 14, 2022	Scene UnderstandingTranslation	—Unverified
Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views	Aug 22, 2023	NeRFNeural Rendering	—Unverified
NuGrounding: A Multi-View 3D Visual Grounding Framework in Autonomous Driving	Mar 28, 2025	3D visual groundingAutonomous Driving	—Unverified
Number-Adaptive Prototype Learning for 3D Point Cloud Semantic Segmentation	Oct 18, 2022	3D Semantic SegmentationScene Understanding	—Unverified
Depth Estimation using Weighted-loss and Transfer Learning	Apr 11, 2024	Autonomous VehiclesDecoder	—Unverified
O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation	Apr 10, 2024	Image SegmentationObject	—Unverified
DepthCut: Improved Depth Edge Estimation Using Multiple Unreliable Channels	May 22, 2017	Scene UnderstandingSegmentation	—Unverified
Object-agnostic Affordance Categorization via Unsupervised Learning of Graph Embeddings	Mar 30, 2023	ObjectScene Understanding	—Unverified
Object as Distribution	Jul 25, 2019	Autonomous DrivingInstance Segmentation	—Unverified
Why my photos look sideways or upside down? Detecting Canonical Orientation of Images using Convolutional Neural Networks	Dec 4, 2017	Object RecognitionScene Understanding	—Unverified
Object-Aware DINO (Oh-A-Dino): Enhancing Self-Supervised Representations for Multi-Object Instance Retrieval	Mar 12, 2025	ObjectRetrieval	—Unverified
Object Aware Egocentric Online Action Detection	Jun 3, 2024	Action DetectionObject	—Unverified
AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model	Dec 20, 2023	Autonomous DrivingScene Understanding	—Unverified
Object-Centric Scene Representations using Active Inference	Feb 7, 2023	ObjectScene Understanding	—Unverified
Deployment of Deep Neural Networks for Object Detection on Edge AI Devices with Runtime Optimization	Aug 18, 2021	2D Object Detection3D Object Detection	—Unverified
Object-level 3D Semantic Mapping using a Network of Smart Edge Sensors	Nov 21, 2022	ObjectPose Estimation	—Unverified

Show:10 25 50

← PrevPage 20 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified