SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 951960 of 1723 papers

TitleStatusHype
Robust Multi-Modal Image Stitching for Improved Scene Understanding0
Cloud-Device Collaborative Learning for Multimodal Large Language Models0
BridgeNet: Comprehensive and Effective Feature Interactions via Bridge Feature for Multi-task Dense Predictions0
Object Attribute Matters in Visual Question AnsweringCode0
AccidentGPT: Accident Analysis and Prevention from V2X Environmental Perception with Multi-modal Large Model0
Language-Assisted 3D Scene Understanding0
Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment0
Dietary Assessment with Multimodal ChatGPT: A Systematic Analysis0
Zoom in on the Plant: Fine-grained Analysis of Leaf, Stem and Vein InstancesCode0
VMT-Adapter: Parameter-Efficient Transfer Learning for Multi-Task Dense Scene Understanding0
Show:102550
← PrevPage 96 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified