Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 526–550 of 1723 papers

Title	Date	Tasks	Status
End-to-End Race Driving with Deep Reinforcement Learning	Jul 6, 2018	Deep Reinforcement LearningDomain Adaptation	—Unverified
End-to-end Autonomous Driving using Deep Learning: A Systematic Review	Aug 27, 2023	Autonomous Drivingobject-detection	—Unverified
Can LVLMs Obtain a Driver's License? A Benchmark Towards Reliable AGI for Autonomous Driving	Sep 4, 2024	Autonomous DrivingDecision Making	—Unverified
Endo-TTAP: Robust Endoscopic Tissue Tracking via Multi-Facet Guided Attention and Hybrid Flow-point Supervision	Mar 28, 2025	Optical Flow EstimationPoint Tracking	—Unverified
Can Large Multimodal Models Understand Agricultural Scenes? Benchmarking with AgroMind	May 18, 2025	BenchmarkingScene Understanding	—Unverified
A Reinforcement Learning Approach to Target Tracking in a Camera Network	Jul 26, 2018	Q-Learningreinforcement-learning	—Unverified
Empowering Large Language Models with 3D Situation Awareness	Mar 29, 2025	Scene Understanding	—Unverified
Empowering cyberphysical systems of systems with intelligence	Jul 5, 2021	Decision MakingManagement	—Unverified
Can Foundation Models Perform Zero-Shot Task Specification For Robot Manipulation?	Apr 23, 2022	Robot ManipulationScene Understanding	—Unverified
EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction	May 2, 2018	Saliency PredictionScene Understanding	—Unverified
Can DeepSeek Reason Like a Surgeon? An Empirical Evaluation for Vision-Language Understanding in Robotic-Assisted Surgery	Mar 29, 2025	Action UnderstandingInstrument Recognition	—Unverified
A Reflectance Based Method For Shadow Detection and Removal	Jul 11, 2018	Detecting ShadowsScene Understanding	—Unverified
A diffusion and clustering-based approach for finding coherent motions and understanding crowd scenes	Feb 16, 2016	ClusteringOptical Flow Estimation	—Unverified
Embracing Diffraction: A Paradigm Shift in Wireless Sensing and Communication	May 2, 2025	Scene Understanding	—Unverified
EmbRACE-3K: Embodied Reasoning and Action in Complex Environments	Jul 14, 2025	Scene UnderstandingSpatial Reasoning	—Unverified
Embodied Visual Active Learning for Semantic Segmentation	Dec 17, 2020	Active LearningDeep Reinforcement Learning	—Unverified
Embodied VideoAgent: Persistent Memory from Egocentric Videos and Embodied Sensors Enables Dynamic Scene Understanding	Dec 31, 2024	Robot ManipulationScene Understanding	—Unverified
Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics	Mar 8, 2023	Autonomous VehiclesScene Understanding	—Unverified
Are Cars Just 3D Boxes? - Jointly Estimating the 3D Shape of Multiple Objects	Jun 1, 2014	3D geometry3D Shape Modeling	—Unverified
Embodied Scene Understanding for Vision Language Models via MetaVQA	Jan 15, 2025	Decision MakingQuestion Answering	—Unverified
Camera-Only Bird's Eye View Perception: A Neural Approach to LiDAR-Free Environmental Mapping for Autonomous Vehicles	May 9, 2025	Autonomous NavigationAutonomous Vehicles	—Unverified
Camera Control at the Edge with Language Models for Scene Understanding	May 9, 2025	Language ModelingLanguage Modelling	—Unverified
Addressing the Sim2Real Gap in Robotic 3D Object Classification	Oct 28, 2019	3D Object ClassificationClassification	—Unverified
3D Shape Augmentation with Content-Aware Shape Resizing	May 15, 2024	3D GenerationScene Understanding	—Unverified
Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception	Oct 2, 2023	Autonomous DrivingImage Segmentation	—Unverified

Show:10 25 50

← PrevPage 22 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified