Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 941–950 of 1723 papers

Title	Date	Tasks	Status
FunnyNet-W: Multimodal Learning of Funny Moments in Videos in the Wild	Jan 8, 2024	Language ModellingLarge Language Model	CodeCode Available
FMGS: Foundation Model Embedded 3D Gaussian Splatting for Holistic 3D Scene Understanding	Jan 3, 2024	object-detectionObject Detection	—Unverified
Going Beyond Multi-Task Dense Prediction with Synergy Embedding Models	Jan 1, 2024	Scene Understanding	—Unverified
Unsupervised 3D Structure Inference from Category-Specific Image Collections	Jan 1, 2024	Graph MatchingObject	—Unverified
When Visual Grounding Meets Gigapixel-level Large-scale Scenes: Benchmark and Approach	Jan 1, 2024	Scene UnderstandingVisual Grounding	—Unverified
SceneFun3D: Fine-Grained Functionality and Affordance Understanding in 3D Scenes	Jan 1, 2024	Instance SegmentationMotion Estimation	—Unverified
PanoRecon: Real-Time Panoptic 3D Reconstruction from Monocular Video	Jan 1, 2024	3D Panoptic Segmentation3D Reconstruction	CodeCode Available
Bilateral Adaptation for Human-Object Interaction Detection with Occlusion-Robustness	Jan 1, 2024	Human-Object Interaction Detectionobject-detection	—Unverified
Towards CLIP-driven Language-free 3D Visual Grounding via 2D-3D Relational Enhancement and Consistency	Jan 1, 2024	3D visual groundingRelation	CodeCode Available
Omni-Q: Omni-Directional Scene Understanding for Unsupervised Visual Grounding	Jan 1, 2024	Scene UnderstandingVisual Grounding	—Unverified

Show:10 25 50

← PrevPage 95 of 173Next →

All datasets Semantic Scene Understanding Challenge (passive actuation & ground-truth localisation)ADE20K val Semantic Scene Understanding Challenge (active actuation & ground-truth localisation)

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.44	—	Unverified
2	Team VGAI (TCS Research)	OMQ	0.37	—	Unverified
3	Demo_semantic_SLAM	OMQ	0.11	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CPN(ResNet-101)	Mean IoU	46.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ACRV Baseline	OMQ	0.35	—	Unverified