SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 721730 of 1723 papers

TitleStatusHype
Multi-stage Factorized Spatio-Temporal Representation for RGB-D Action and Gesture RecognitionCode1
SUMMIT: Source-Free Adaptation of Uni-Modal Models to Multi-Modal TargetsCode1
Understanding Dark Scenes by Contrasting Multi-Modal ObservationsCode1
ScanNet++: A High-Fidelity Dataset of 3D Indoor ScenesCode2
Novel-view Synthesis and Pose Estimation for Hand-Object Interaction from Sparse Views0
Explore and Tell: Embodied Visual Captioning in 3D Environments0
Vision Relation Transformer for Unbiased Scene Graph GenerationCode1
Chat-3D: Data-efficiently Tuning Large Language Model for Universal Dialogue of 3D ScenesCode2
CASPNet++: Joint Multi-Agent Motion Prediction0
FocusFlow: Boosting Key-Points Optical Flow Estimation for Autonomous DrivingCode1
Show:102550
← PrevPage 73 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified