SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 751775 of 1723 papers

TitleStatusHype
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition0
Improving Building Segmentation for Off-Nadir Satellite Imagery0
Dense Supervision Propagation for Weakly Supervised Semantic Segmentation on 3D Point Clouds0
Label-Driven Reconstruction for Domain Adaptation in Semantic Segmentation0
Improving 6D Object Pose Estimation of metallic Household and Industry Objects0
Deep ensembles based on Stochastic Activation Selection for Polyp Segmentation0
LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding0
A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces0
Looking Beyond the Visible Scene0
IMENet: Joint 3D Semantic Scene Completion and 2D Semantic Segmentation through Iterative Mutual Enhancement0
Image-to-Height Domain Translation for Synthetic Aperture Sonar0
Deep cross-domain building extraction for selective depth estimation from oblique aerial imagery0
Language-EXtended Indoor SLAM (LEXIS): A Versatile System for Real-time Visual Scene Understanding0
LargeAD: Large-Scale Cross-Sensor Data Pretraining for Autonomous Driving0
Large Language Models for Autonomous Driving (LLM4AD): Concept, Benchmark, Experiments, and Challenges0
Large Language Models (LLMs) as Traffic Control Systems at Urban Intersections: A New Paradigm0
Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems0
A Comprehensive Review of Modern Object Segmentation Approaches0
LCrowdV: Generating Labeled Videos for Simulation-based Crowd Behavior Learning0
Leader360V: The Large-scale, Real-world 360 Video Dataset for Multi-task Learning in Diverse Environment0
Image Parsing with Stochastic Scene Grammar0
Learning 3D Robotics Perception using Inductive Priors0
Deep Contextual Attention for Human-Object Interaction Detection0
Learning 3D Semantic Scene Graphs from 3D Indoor Reconstructions0
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding0
Show:102550
← PrevPage 31 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified