SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 451460 of 1723 papers

TitleStatusHype
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
TextSLAM: Visual SLAM with Planar Text FeaturesCode1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth EstimationCode1
The Cityscapes Dataset for Semantic Urban Scene UnderstandingCode1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene UnderstandingCode1
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D NetworksCode1
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban RoadwaysCode1
Grounded Situation Recognition with TransformersCode1
Image Masking for Robust Self-Supervised Monocular Depth EstimationCode1
Show:102550
← PrevPage 46 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified