SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 326350 of 1723 papers

TitleStatusHype
MSeg: A Composite Dataset for Multi-domain Semantic SegmentationCode1
Channel-Wise Attention-Based Network for Self-Supervised Monocular Depth EstimationCode1
Comprehensive Visual Question Answering on Point Clouds through Compositional Scene ManipulationCode1
ScanQA: 3D Question Answering for Spatial Scene UnderstandingCode1
Activation Modulation and Recalibration Scheme for Weakly Supervised Semantic SegmentationCode1
PolyphonicFormer: Unified Query Learning for Depth-aware Video Panoptic SegmentationCode1
Behind the Curtain: Learning Occluded Shapes for 3D Object DetectionCode1
AirObject: A Temporally Evolving Graph Embedding for Object IdentificationCode1
Instance-wise Occlusion and Depth Orders in Natural ScenesCode1
Cerberus Transformer: Joint Semantic, Affordance and Attribute ParsingCode1
Grounded Situation Recognition with TransformersCode1
ARKitScenes: A Diverse Real-World Dataset For 3D Indoor Scene Understanding Using Mobile RGB-D DataCode1
Learning Object-Centric Representations of Multi-Object Scenes from Multiple ViewsCode1
Panoptic 3D Scene Reconstruction From a Single RGB ImageCode1
3DP3: 3D Scene Perception via Probabilistic ProgrammingCode1
A Versatile and Efficient Reinforcement Learning Framework for Autonomous DrivingCode1
PlaneRecNet: Multi-Task Learning with Cross-Task Consistency for Piece-Wise Plane Detection and Reconstruction from a Single RGB ImageCode1
Structured Bird's-Eye-View Traffic Scene Understanding from Onboard ImagesCode1
KITTI-360: A Novel Dataset and Benchmarks for Urban Scene Understanding in 2D and 3DCode1
Semantic Segmentation-assisted Scene Completion for LiDAR Point CloudsCode1
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal EstimationCode1
PQ-Transformer: Jointly Parsing 3D Objects and Layouts from Point CloudsCode1
Spatio-temporal Self-Supervised Representation Learning for 3D Point CloudsCode1
From General to Specific: Informative Scene Graph Generation via Balance AdjustmentCode1
DeepPanoContext: Panoramic 3D Scene Understanding with Holistic Scene Context Graph and Relation-based OptimizationCode1
Show:102550
← PrevPage 14 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified