SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 951975 of 1723 papers

TitleStatusHype
Semantic Segmentation-Assisted Instance Feature Fusion for Multi-Level 3D Part Instance SegmentationCode1
TAG: Boosting Text-VQA via Text-aware Visual Question-answer GenerationCode1
AutoLaparo: A New Dataset of Integrated Multi-tasks for Image-guided Surgical Automation in Laparoscopic Hysterectomy0
Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion TransformerCode2
MonteBoxFinder: Detecting and Filtering Primitives to Fit a Noisy Point CloudCode1
CENet: Toward Concise and Efficient LiDAR Semantic Segmentation for Autonomous DrivingCode1
CompNVS: Novel View Synthesis with Scene Completion0
Semantic Abstraction: Open-World 3D Scene Understanding from 2D Vision-Language ModelsCode1
Panoptic Scene Graph GenerationCode2
Divide and Conquer: 3D Point Cloud Instance Segmentation With Point-Wise BinarizationCode1
Neural Groundplans: Persistent Neural Scene Representations from a Single Image0
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany0
Egocentric Scene Understanding via Multimodal Spatial RectifierCode1
Adversarial Attacks on Monocular Pose EstimationCode0
Efficient Multi-Task RGB-D Scene Analysis for Indoor EnvironmentsCode1
BlindSpotNet: Seeing Where We Cannot See0
MCTS with Refinement for Proposals Selection Games in Scene UnderstandingCode1
Toward Explainable and Fine-Grained 3D Grounding through Referring Textual Phrases0
Distance Matters in Human-Object Interaction DetectionCode0
Scene-Aware Prompt for Multi-modal Dialogue Understanding and Generation0
Uncertainty-aware Panoptic SegmentationCode1
MGNet: Monocular Geometric Scene Understanding for Autonomous DrivingCode1
IBISCape: A Simulated Benchmark for multi-modal SLAM Systems Evaluation in Large-scale Dynamic EnvironmentsCode1
Placental Vessel Segmentation and Registration in Fetoscopy: Literature Review and MICCAI FetReg2021 Challenge FindingsCode0
Panoramic Panoptic Segmentation: Insights Into Surrounding Parsing for Mobile Agents via Unsupervised Contrastive LearningCode1
Show:102550
← PrevPage 39 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified