SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 221230 of 1723 papers

TitleStatusHype
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views0
Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation0
Event-aided Semantic Scene CompletionCode1
AquaticCLIP: A Vision-Language Foundation Model for Underwater Scene Analysis0
Integrating LMM Planners and 3D Skill Policies for Generalizable Manipulation0
Efficient Interactive 3D Multi-Object Removal0
Contextual Self-paced Learning for Weakly Supervised Spatio-Temporal Video Grounding0
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding0
Unveiling the Potential of iMarkers: Invisible Fiducial Markers for Advanced Robotics0
HERMES: A Unified Self-Driving World Model for Simultaneous 3D Scene Understanding and GenerationCode3
Show:102550
← PrevPage 23 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified