Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1601–1650 of 1723 papers

Title	Date	Tasks	Status
FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation	Feb 13, 2025	Autonomous DrivingLIDAR Semantic Segmentation	—Unverified
Global Context Aware Convolutions for 3D Point Cloud Understanding	Aug 7, 2020	Point Cloud ClassificationRetrieval	—Unverified
Fine-Grained Off-Road Semantic Segmentation and Mapping via Contrastive Learning	Mar 5, 2021	Binary ClassificationContrastive Learning	—Unverified
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane	May 27, 2024	3DGSfeature selection	—Unverified
Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models	Jan 1, 2024	Scene Understanding	—Unverified
Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation	Jun 1, 2019	Depth EstimationMonocular Depth Estimation	—Unverified
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment	Apr 11, 2025	3D geometryNatural Language Queries	—Unverified
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding	Nov 20, 2023	Instance SegmentationNeRF	—Unverified
FHGS: Feature-Homogenized Gaussian Splatting	May 25, 2025	3DGSScene Understanding	—Unverified
GPT-4V Explorations: Mining Autonomous Driving	Jun 24, 2024	Autonomous DrivingDecision Making	—Unverified
GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction	Nov 24, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
GP-VLS: A general-purpose vision language model for surgery	Jul 27, 2024	Language ModelingLanguage Modelling	—Unverified
Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving	Nov 6, 2024	Autonomous DrivingMulti-Object Tracking	—Unverified
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified
Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations	Mar 13, 2025	Autonomous VehiclesKnowledge Graphs	—Unverified
Towards Scene Understanding with Detailed 3D Object Representations	Nov 18, 2014	3D Pose EstimationObject	—Unverified
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Oct 6, 2022	Scene Understanding	—Unverified
Grounded Objects and Interactions for Video Captioning	Nov 16, 2017	ObjectScene Understanding	—Unverified
Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion	Jun 19, 2021	Camera Pose EstimationDecoder	—Unverified
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction	Mar 8, 2025	3DGSimage-classification	—Unverified
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model	Jan 1, 2025	AttributeLanguage Modeling	—Unverified
Feature discovery and visualization of robot mission data using convolutional autoencoders and Bayesian nonparametric topic models	Nov 30, 2017	Scene UnderstandingTopic Models	—Unverified
Fast Object Detection with a Machine Learning Edge Device	Oct 5, 2024	Autonomous NavigationCPU	—Unverified
Fast Neural Architecture Search for Lightweight Dense Prediction Networks	Mar 3, 2022	Depth EstimationImage Super-Resolution	—Unverified
GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding	Mar 6, 2024	NeRFScene Understanding	—Unverified
HAECcity: Open-Vocabulary Scene Understanding of City-Scale Point Clouds with Superpoint Graph Clustering	Apr 18, 2025	ClusteringGraph Clustering	—Unverified
FastLGS: Speeding up Language Embedded Gaussians with Feature Grid Mapping	Jun 4, 2024	3DGSScene Understanding	—Unverified
Hallucinated Humans as the Hidden Context for Labeling 3D Scenes	Jun 1, 2013	AttributeObject	—Unverified
HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning	May 21, 2025	Autonomous DrivingMamba	—Unverified
Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images	Aug 27, 2024	Organ SegmentationScene Segmentation	—Unverified
HAtt-Flow: Hierarchical Attention-Flow Mechanism for Group Activity Scene Graph Generation in Videos	Nov 28, 2023	Graph GenerationScene Graph Generation	—Unverified
FALCON: Fairness Learning via Contrastive Attention Approach to Continual Semantic Scene Understanding	Nov 27, 2023	Continual LearningContinual Semantic Segmentation	—Unverified
Transavs: End-To-End Audio-Visual Segmentation With Transformer	May 12, 2023	Scene UnderstandingSegmentation	—Unverified
HeLiMOS: A Dataset for Moving Object Segmentation in 3D Point Clouds From Heterogeneous LiDAR Sensors	Aug 12, 2024	Scene UnderstandingSemantic Segmentation	—Unverified
Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments	May 25, 2023	Continual LearningContinual Semantic Segmentation	—Unverified
Heterogeneous Visual Features Fusion via Sparse Multimodal Machine	Jun 1, 2013	Feature Importanceimage-classification	—Unverified
HGS-Mapping: Online Dense Mapping Using Hybrid Gaussian Representation in Urban Scenes	Mar 29, 2024	3DGSAutonomous Vehicles	—Unverified
Towards seamless multi-view scene analysis from satellite to street-level	May 23, 2017	Change DetectionEarth Observation	—Unverified
Hierarchical Scene Parsing by Weakly Supervised Learning with Image Descriptions	Sep 27, 2017	DescriptiveObject	—Unverified
Algorithmic Design and Implementation of Unobtrusive Multistatic Serial LiDAR Image	Nov 8, 2019	Scene Understanding	—Unverified
Towards Trustworthy Automated Driving through Qualitative Scene Understanding and Explanations	Mar 25, 2024	Scene Understanding	—Unverified
Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction	Mar 9, 2019	DenoisingObject	—Unverified
High-Accuracy Facial Depth Models derived from 3D Synthetic Data	Mar 26, 2020	3D ReconstructionDepth Estimation	—Unverified
Highway Driving Dataset for Semantic Video Segmentation	Nov 2, 2020	Autonomous DrivingImage Segmentation	—Unverified
Factor Graph based 3D Multi-Object Tracking in Point Clouds	Aug 12, 2020	3D Multi-Object TrackingMulti-Object Tracking	—Unverified
HIS-GPT: Towards 3D Human-In-Scene Multimodal Understanding	Mar 17, 2025	Question AnsweringScene Understanding	—Unverified
Factored Neural Representation for Scene Understanding	Apr 21, 2023	Novel View SynthesisObject	—Unverified
Generalized 3D Self-supervised Learning Framework via Prompted Foreground-Aware Feature Contrast	Mar 11, 2023	3D Semantic SegmentationContrastive Learning	—Unverified
HOIverse: A Synthetic Scene Graph Dataset With Human Object Interactions	Jun 24, 2025	Graph GenerationHuman-Object Interaction Detection	—Unverified
Towards Unseen Triples: Effective Text-Image-joint Learning for Scene Graph Generation	Jun 23, 2023	Graph GenerationScene Graph Generation	—Unverified

Show:10 25 50

← PrevPage 33 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified