Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 576–600 of 1723 papers

Title	Date	Tasks	Status	Score
Deep Video Deblurring for Hand-Held Cameras	Jul 1, 2017	DeblurringImage Deblurring	CodeCode Available	5
Deep Video Deblurring	Nov 25, 2016	DeblurringImage Deblurring	CodeCode Available	5
Deep Surface Normal Estimation with Hierarchical RGB-D Fusion	Apr 6, 2019	Scene UnderstandingSurface Normal Estimation	CodeCode Available	5
A Critical Assessment of Visual Sound Source Localization Models Including Negative Audio	Oct 1, 2024	Scene UnderstandingSound Source Localization	CodeCode Available	5
Omni-Recon: Harnessing Image-based Rendering for General-Purpose Neural Radiance Fields	Mar 17, 2024	3D ReconstructionNeRF	CodeCode Available	5
Object Attribute Matters in Visual Question Answering	Dec 20, 2023	AttributeGraph Neural Network	CodeCode Available	5
Object-aware Sound Source Localization via Audio-Visual Scene Understanding	Jan 1, 2025	Scene UnderstandingSound Source Localization	CodeCode Available	5
Benchmarking Feature Upsampling Methods for Vision Foundation Models using Interactive Segmentation	May 4, 2025	BenchmarkingFeature Upsampling	CodeCode Available	5
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a Supercomputer	Apr 3, 2019	Deep Reinforcement LearningReinforcement Learning	CodeCode Available	5
Non-central panorama indoor dataset	Jan 30, 2024	Scene Understanding	CodeCode Available	5
NextStop: An Improved Tracker For Panoptic LIDAR Segmentation Data	Jan 8, 2025	Autonomous DrivingInstance Segmentation	CodeCode Available	5
Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship Detection	Feb 15, 2019	Relationship DetectionScene Understanding	CodeCode Available	5
Neural Radiance Field Codebooks	Jan 10, 2023	ObjectRepresentation Learning	CodeCode Available	5
Neighbor-Vote: Improving Monocular 3D Object Detection through Neighbor Distance Voting	Jul 6, 2021	3D Object DetectionAutonomous Driving	CodeCode Available	5
Neural RGB->D Sensing: Depth and Uncertainty from a Video Camera	Jan 9, 2019	3D Reconstruction3D Scene Reconstruction	CodeCode Available	5
Deep Learning based Switching Filter for Impulsive Noise Removal in Color Images	Dec 3, 2019	DenoisingImage Denoising	CodeCode Available	5
BACS: Background Aware Continual Semantic Segmentation	Apr 19, 2024	Autonomous DrivingContinual Learning	CodeCode Available	5
Deep Learning--Based Scene Simplification for Bionic Vision	Jan 30, 2021	Deep LearningDepth Estimation	CodeCode Available	5
DeepIPCv2: LiDAR-powered Robust Environmental Perception and Navigational Control for Autonomous Vehicle	Jul 13, 2023	Autonomous DrivingScene Understanding	CodeCode Available	5
AVS-Net: Point Sampling with Adaptive Voxel Size for 3D Scene Understanding	Feb 27, 2024	3D Object Detection3D Part Segmentation	CodeCode Available	5
Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label Uncertainty	May 2, 2018	Scene UnderstandingSensor Fusion	CodeCode Available	5
Multi-task Geometric Estimation of Depth and Surface Normal from Monocular 360° Images	Nov 4, 2024	Multi-Task LearningScene Understanding	CodeCode Available	5
Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?	Sep 5, 2018	3D ReconstructionDepth Estimation	CodeCode Available	5
AVQACL: A Novel Benchmark for Audio-Visual Question Answering Continual Learning	Jan 1, 2025	Audio-visual Question AnsweringContinual Learning	CodeCode Available	5
Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation	Mar 3, 2021	Autonomous DrivingDepth Estimation	CodeCode Available	5

Show:10 25 50

← PrevPage 24 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified