Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1626–1650 of 1723 papers

Title	Date	Tasks	Status
Flow-based GAN for 3D Point Cloud Generation from a Single Image	Oct 8, 2022	Point Cloud GenerationScene Understanding	CodeCode Available
Scene Graph Generation from Objects, Phrases and Region Captions	Jul 31, 2017	Graph Generationobject-detection	CodeCode Available
Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation	May 30, 2023	Graph GenerationImage Generation	CodeCode Available
Auxiliary Tasks in Multi-task Learning	May 16, 2018	Depth EstimationMulti-Task Learning	CodeCode Available
Auto-Embedding Generative Adversarial Networks for High Resolution Image Synthesis	Mar 27, 2019	Generative Adversarial NetworkImage Generation	CodeCode Available
Implicit Background Estimation for Semantic Segmentation	May 23, 2019	Scene UnderstandingSegmentation	CodeCode Available
SceneNet RGB-D: 5M Photorealistic Images of Synthetic Indoor Trajectories with Ground Truth	Dec 15, 2016	3D ReconstructionCamera Pose Estimation	CodeCode Available
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge Findings	Jun 24, 2022	Scene UnderstandingSemantic Segmentation	CodeCode Available
SceneNet: Understanding Real World Indoor Scenes With Synthetic Data	Nov 22, 2015	Scene Understanding	CodeCode Available
Fast Scene Understanding for Autonomous Driving	Aug 8, 2017	Autonomous DrivingDecoder	CodeCode Available
Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search	Jul 10, 2024	Few-Shot LearningGPU	CodeCode Available
Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory	Jul 4, 2021	Question AnsweringScene Understanding	CodeCode Available
Leveraging Acoustic Images for Effective Self-Supervised Audio Representation Learning	Aug 1, 2020	Cross-Modal RetrievalRepresentation Learning	CodeCode Available
Attend, Infer, Repeat: Fast Scene Understanding with Generative Models	Mar 28, 2016	Scene Understanding	CodeCode Available
A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection Inference	Jul 19, 2024	Autonomous Vehiclesobject-detection	CodeCode Available
False Negative Reduction in Video Instance Segmentation using Uncertainty Estimates	Jun 28, 2021	Depth EstimationInstance Segmentation	CodeCode Available
3D Semantic Segmentation of Modular Furniture using rjMCMC	May 15, 2017	3D Semantic Segmentationfurniture segmentation	CodeCode Available
Uncertainty-aware LiDAR Panoptic Segmentation	Oct 10, 2022	Autonomous DrivingPanoptic Segmentation	CodeCode Available
Facing the Void: Overcoming Missing Data in Multi-View Imagery	May 21, 2022	Classificationimage-classification	CodeCode Available
COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images	Jan 26, 2016	DiversityGeneral Classification	CodeCode Available
CNN-based Lidar Point Cloud De-Noising in Adverse Weather	Dec 9, 2019	Autonomous VehiclesScene Understanding	CodeCode Available
AdaptVision: Dynamic Input Scaling in MLLMs for Versatile Scene Understanding	Aug 30, 2024	Language ModellingLarge Language Model	CodeCode Available
An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutions	Feb 20, 2019	Autonomous DrivingScene Understanding	CodeCode Available
SCIM: Simultaneous Clustering, Inference, and Mapping for Open-World Semantic Scene Understanding	Jun 21, 2022	ClusteringObject Discovery	CodeCode Available
Extremely Fine-Grained Visual Classification over Resembling Glyphs in the Wild	Aug 25, 2024	Contrastive LearningFine-Grained Image Classification	CodeCode Available

Show:10 25 50

← PrevPage 66 of 69Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified