SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 426450 of 1723 papers

TitleStatusHype
ThreeDWorld: A Platform for Interactive Multi-Modal Physical SimulationCode1
Learning and Reasoning with the Graph Structure Representation in Robotic SurgeryCode1
A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine IntelligenceCode1
Learning Visual Commonsense for Robust Scene Graph GenerationCode1
Benchmarking Unsupervised Object Representations for Video SequencesCode1
0-MMS: Zero-Shot Multi-Motion Segmentation With A Monocular Event CameraCode1
IDA-3D: Instance-Depth-Aware 3D Object Detection From Stereo Vision for Autonomous DrivingCode1
VTGNet: A Vision-based Trajectory Generation Network for Autonomous Vehicles in Urban EnvironmentsCode1
Cityscapes-Panoptic-Parts and PASCAL-Panoptic-Parts datasets for Scene UnderstandingCode1
Self-Supervised Scene De-occlusionCode1
Context Prior for Scene SegmentationCode1
PointGroup: Dual-Set Point Grouping for 3D Instance SegmentationCode1
Semantic Segmentation of Underwater Imagery: Dataset and BenchmarkCode1
Occlusion-Aware Depth Estimation with Adaptive Normal ConstraintsCode1
Distilled Semantics for Comprehensive Scene Understanding from VideosCode1
Learning Human-Object Interaction Detection using Interaction PointsCode1
LayoutMP3D: Layout Annotation of Matterport3DCode1
Multi-Path Region Mining For Weakly Supervised 3D Semantic Segmentation on Point CloudsCode1
CAKES: Channel-wise Automatic KErnel Shrinking for Efficient 3D NetworksCode1
SaccadeNet: A Fast and Accurate Object DetectorCode1
Who2com: Collaborative Perception via Learnable Handshake CommunicationCode1
Explainable Object-induced Action Decision for Autonomous VehiclesCode1
Toronto-3D: A Large-scale Mobile LiDAR Dataset for Semantic Segmentation of Urban RoadwaysCode1
Scene Completeness-Aware Lidar Depth Completion for Driving ScenarioCode1
Total3DUnderstanding: Joint Layout, Object Pose and Mesh Reconstruction for Indoor Scenes from a Single ImageCode1
Show:102550
← PrevPage 18 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified