SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 121130 of 1723 papers

TitleStatusHype
RS-RAG: Bridging Remote Sensing Imagery and Comprehensive Knowledge with a Multi-Modal Dataset and Retrieval-Augmented Generation Model0
DFormerv2: Geometry Self-Attention for RGBD Semantic SegmentationCode3
Planning Safety Trajectories with Dual-Phase, Physics-Informed, and Transportation Knowledge-Driven Large Language ModelsCode0
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
Multimodal Fusion and Vision-Language Models: A Survey for Robot VisionCode1
Scene-Centric Unsupervised Panoptic SegmentationCode2
CoMatcher: Multi-View Collaborative Feature Matching0
TransforMerger: Transformer-based Voice-Gesture Fusion for Robust Human-Robot Communication0
Overlap-Aware Feature Learning for Robust Unsupervised Domain Adaptation for 3D Semantic Segmentation0
Ross3D: Reconstructive Visual Instruction Tuning with 3D-Awareness0
Show:102550
← PrevPage 13 of 173Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified