Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1601–1625 of 1723 papers

Title	Date	Tasks	Status
FLARES: Fast and Accurate LiDAR Multi-Range Semantic Segmentation	Feb 13, 2025	Autonomous DrivingLIDAR Semantic Segmentation	—Unverified
Global Context Aware Convolutions for 3D Point Cloud Understanding	Aug 7, 2020	Point Cloud ClassificationRetrieval	—Unverified
Fine-Grained Off-Road Semantic Segmentation and Mapping via Contrastive Learning	Mar 5, 2021	Binary ClassificationContrastive Learning	—Unverified
GOI: Find 3D Gaussians of Interest with an Optimizable Open-vocabulary Semantic-space Hyperplane	May 27, 2024	3DGSfeature selection	—Unverified
Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models	Jan 1, 2024	Scene Understanding	—Unverified
Towards Scene Understanding: Unsupervised Monocular Depth Estimation With Semantic-Aware Representation	Jun 1, 2019	Depth EstimationMonocular Depth Estimation	—Unverified
FindAnything: Open-Vocabulary and Object-Centric Mapping for Robot Exploration in Any Environment	Apr 11, 2025	3D geometryNatural Language Queries	—Unverified
GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding	Nov 20, 2023	Instance SegmentationNeRF	—Unverified
FHGS: Feature-Homogenized Gaussian Splatting	May 25, 2025	3DGSScene Understanding	—Unverified
GPT-4V Explorations: Mining Autonomous Driving	Jun 24, 2024	Autonomous DrivingDecision Making	—Unverified
GPT-4V Takes the Wheel: Promises and Challenges for Pedestrian Behavior Prediction	Nov 24, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified
GP-VLS: A general-purpose vision language model for surgery	Jul 27, 2024	Language ModelingLanguage Modelling	—Unverified
Graph-Based Multi-Modal Sensor Fusion for Autonomous Driving	Nov 6, 2024	Autonomous DrivingMulti-Object Tracking	—Unverified
Graph-DETR3D: Rethinking Overlapping Regions for Multi-View 3D Object Detection	Apr 25, 2022	3D Object DetectionGraph structure learning	—Unverified
Graph-Grounded LLMs: Leveraging Graphical Function Calling to Minimize LLM Hallucinations	Mar 13, 2025	Autonomous VehiclesKnowledge Graphs	—Unverified
Towards Scene Understanding with Detailed 3D Object Representations	Nov 18, 2014	3D Pose EstimationObject	—Unverified
Feature-Realistic Neural Fusion for Real-Time, Open Set Scene Understanding	Oct 6, 2022	Scene Understanding	—Unverified
Grounded Objects and Interactions for Video Captioning	Nov 16, 2017	ObjectScene Understanding	—Unverified
Feature-Level Collaboration: Joint Unsupervised Learning of Optical Flow, Stereo Depth and Camera Motion	Jun 19, 2021	Camera Pose EstimationDecoder	—Unverified
Feature-EndoGaussian: Feature Distilled Gaussian Splatting in Surgical Deformable Scene Reconstruction	Mar 8, 2025	3DGSimage-classification	—Unverified
GroundingFace: Fine-grained Face Understanding via Pixel Grounding Multimodal Large Language Model	Jan 1, 2025	AttributeLanguage Modeling	—Unverified
Feature discovery and visualization of robot mission data using convolutional autoencoders and Bayesian nonparametric topic models	Nov 30, 2017	Scene UnderstandingTopic Models	—Unverified
Fast Object Detection with a Machine Learning Edge Device	Oct 5, 2024	Autonomous NavigationCPU	—Unverified
Fast Neural Architecture Search for Lightweight Dense Prediction Networks	Mar 3, 2022	Depth EstimationImage Super-Resolution	—Unverified
GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding	Mar 6, 2024	NeRFScene Understanding	—Unverified

Show:10 25 50

← PrevPage 65 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified