SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 451475 of 1723 papers

TitleStatusHype
The Cityscapes Dataset for Semantic Urban Scene UnderstandingCode1
The Coralscapes Dataset: Semantic Scene Understanding in Coral ReefsCode1
ThreeDWorld: A Platform for Interactive Multi-Modal Physical SimulationCode1
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene UnderstandingCode1
Efficient Multi-Task RGB-D Scene Analysis for Indoor EnvironmentsCode1
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D NetworksCode1
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object DetectionCode1
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
Egocentric Scene Understanding via Multimodal Spatial RectifierCode1
Towards In-context Scene UnderstandingCode1
CamContextI2V: Context-aware Controllable Video GenerationCode1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataCode1
3UR-LLM: An End-to-End Multimodal Large Language Model for 3D Scene UnderstandingCode1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth EstimationCode1
TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in RainCode1
From General to Specific: Informative Scene Graph Generation via Balance AdjustmentCode1
GroupContrast: Semantic-aware Self-supervised Representation Learning for 3D UnderstandingCode1
Training-Free Hierarchical Scene Understanding for Gaussian Splatting with Superpoint GraphsCode1
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor SceneCode1
TransKD: Transformer Knowledge Distillation for Efficient Semantic SegmentationCode1
TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic SegmentationCode1
Instance Segmentation in 3D Scenes using Semantic Superpoint Tree NetworksCode1
Uncertainty-aware Panoptic SegmentationCode1
Uncertainty-Driven Active Vision for Implicit Scene ReconstructionCode1
Microsoft COCO: Common Objects in ContextCode1
Show:102550
← PrevPage 19 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified