SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 17011723 of 1723 papers

TitleStatusHype
AP-MTL: Attention Pruned Multi-task Learning Model for Real-time Instrument Detection and Segmentation in Robot-assisted SurgeryCode0
3D Object Detection from Point Cloud via Voting Step DiffusionCode0
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random FieldsCode0
Joint stereo 3D object detection and implicit surface reconstructionCode0
Interpretable Visual Understanding with Cognitive Attention NetworkCode0
Weakly Supervised Affordance DetectionCode0
Interactive Learning for Semantic Segmentation in Earth ObservationCode0
Semantic Segmentation with High Inference Speed in Off-Road EnvironmentsCode0
Cognitive TransFuser: Semantics-guided Transformer-based Sensor Fusion for Improved Waypoint PredictionCode0
BlitzNet: A Real-Time Deep Network for Scene UnderstandingCode0
Semantic Understanding of Foggy Scenes with Purely Synthetic DataCode0
UAVid: A Semantic Segmentation Dataset for UAV ImageryCode0
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph ConsensusCode0
D-Net: A Generalised and Optimised Deep Network for Monocular Depth EstimationCode0
Three for one and one for three: Flow, Segmentation, and Surface NormalsCode0
Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV ImageryCode0
Where Does It End? -- Reasoning About Hidden Surfaces by Object Intersection ConstraintsCode0
Distance Matters in Human-Object Interaction DetectionCode0
InfoNorm: Mutual Information Shaping of Normals for Sparse-View ReconstructionCode0
Inferring Distributions Over Depth from a Single ImageCode0
Visual Translation Embedding Network for Visual Relation DetectionCode0
Where Does It End? - Reasoning About Hidden Surfaces by Object Intersection ConstraintsCode0
Dirty Pixels: Towards End-to-End Image Processing and PerceptionCode0
Show:102550
← PrevPage 35 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified