Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 781–790 of 1723 papers

Title	Date	Tasks	Status	Hype
Towards Label-free Scene Understanding by Vision Foundation Models	Jun 6, 2023	image-classificationImage Classification	CodeCode Available	1
Disaster Anomaly Detector via Deeper FCDDs for Explainable Initial Responses	Jun 5, 2023	Anomaly DetectionDisaster Response	—Unverified	0
Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing	Jun 5, 2023	Scene ParsingScene Understanding	—Unverified	0
Multi-CLIP: Contrastive Vision-Language Pre-training for Question Answering tasks in 3D Scenes	Jun 4, 2023	Common Sense ReasoningQuestion Answering	—Unverified	0
Towards In-context Scene Understanding	Jun 2, 2023	Depth EstimationIn-Context Learning	CodeCode Available	1
Self-supervised Vision Transformers for 3D Pose Estimation of Novel Objects	May 31, 2023	3D Pose EstimationContrastive Learning	CodeCode Available	0
Point-GCC: Universal Self-supervised 3D Scene Pre-training via Geometry-Color Contrast	May 31, 2023	3D Instance Segmentation3D Object Detection	CodeCode Available	1
Dynamic Clustering Transformer Network for Point Cloud Segmentation	May 30, 2023	ClusteringDecoder	—Unverified	0
Fine-Grained is Too Coarse: A Novel Data-Centric Approach for Efficient Scene Graph Generation	May 30, 2023	Graph GenerationImage Generation	CodeCode Available	0
Multi-Scale Attention for Audio Question Answering	May 29, 2023	Audio Question AnsweringQuestion Answering	CodeCode Available	1

Show:10 25 50

← PrevPage 79 of 173Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified