Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1701–1723 of 1723 papers

Title	Date	Tasks	Status
AP-MTL: Attention Pruned Multi-task Learning Model for Real-time Instrument Detection and Segmentation in Robot-assisted Surgery	Mar 10, 2020	Multi-Task LearningScene Understanding	CodeCode Available
3D Object Detection from Point Cloud via Voting Step Diffusion	Mar 21, 2024	3D Object DetectionObject	CodeCode Available
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random Fields	Apr 1, 2019	3D Instance Segmentation3D Semantic Instance Segmentation	CodeCode Available
Joint stereo 3D object detection and implicit surface reconstruction	Nov 25, 2021	3D Object DetectionHallucination	CodeCode Available
Interpretable Visual Understanding with Cognitive Attention Network	Aug 6, 2021	Scene UnderstandingVisual Commonsense Reasoning	CodeCode Available
Weakly Supervised Affordance Detection	Jul 1, 2017	Affordance DetectionObject	CodeCode Available
Interactive Learning for Semantic Segmentation in Earth Observation	Sep 23, 2020	Domain AdaptationEarth Observation	CodeCode Available
Semantic Segmentation with High Inference Speed in Off-Road Environments	Apr 10, 2023	2D Semantic SegmentationAutonomous Vehicles	CodeCode Available
Cognitive TransFuser: Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint Prediction	Aug 4, 2023	Imitation LearningScene Understanding	CodeCode Available
BlitzNet: A Real-Time Deep Network for Scene Understanding	Aug 9, 2017	Autonomous DrivingObject	CodeCode Available
Semantic Understanding of Foggy Scenes with Purely Synthetic Data	Oct 9, 2019	Scene UnderstandingSelf-Driving Cars	CodeCode Available
UAVid: A Semantic Segmentation Dataset for UAV Imagery	Oct 24, 2018	4kAutonomous Driving	CodeCode Available
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph Consensus	Oct 2, 2020	Scene UnderstandingSemantic Segmentation	CodeCode Available
D-Net: A Generalised and Optimised Deep Network for Monocular Depth Estimation	Sep 29, 2021	Depth EstimationMonocular Depth Estimation	CodeCode Available
Three for one and one for three: Flow, Segmentation, and Surface Normals	Jul 19, 2018	Optical Flow EstimationScene Understanding	CodeCode Available
Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV Imagery	Feb 5, 2021	Earth ObservationScene Understanding	CodeCode Available
Where Does It End? -- Reasoning About Hidden Surfaces by Object Intersection Constraints	Apr 9, 2020	ObjectScene Understanding	CodeCode Available
Distance Matters in Human-Object Interaction Detection	Jul 5, 2022	Human-Object Interaction DetectionObject	CodeCode Available
InfoNorm: Mutual Information Shaping of Normals for Sparse-View Reconstruction	Jul 17, 2024	Scene UnderstandingSurface Reconstruction	CodeCode Available
Inferring Distributions Over Depth from a Single Image	Dec 12, 2019	Autonomous VehiclesBinary Classification	CodeCode Available
Visual Translation Embedding Network for Visual Relation Detection	Feb 27, 2017	Objectobject-detection	CodeCode Available
Where Does It End? - Reasoning About Hidden Surfaces by Object Intersection Constraints	Jun 1, 2020	ObjectScene Understanding	CodeCode Available
Dirty Pixels: Towards End-to-End Image Processing and Perception	Jan 23, 2017	Autonomous DrivingDeblurring	CodeCode Available

Show:10 25 50

← PrevPage 35 of 35Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified