SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 291300 of 1723 papers

TitleStatusHype
Deep Learning for Event-based Vision: A Comprehensive Survey and BenchmarksCode1
Dense Audio-Visual Event Localization under Cross-Modal Consistency and Multi-Temporal Granularity CollaborationCode1
Complementary Random Masking for RGB-Thermal Semantic SegmentationCode1
Detecting Human-Object Interaction via Fabricated Compositional LearningCode1
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose EstimationCode1
Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object DetectionCode1
DIP: Unsupervised Dense In-Context Post-training of Visual RepresentationsCode1
DI-V2X: Learning Domain-Invariant Representation for Vehicle-Infrastructure Collaborative 3D Object DetectionCode1
A Survey of World Models for Autonomous DrivingCode1
Affect2MM: Affective Analysis of Multimedia Content Using Emotion CausalityCode1
Show:102550
← PrevPage 30 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified