SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 401425 of 1723 papers

TitleStatusHype
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor EnvironmentsCode1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene UnderstandingCode1
Understanding Bird's-Eye View of Road Semantics using an Onboard CameraCode1
Towards Part-Based Understanding of RGB-D ScansCode1
Group Contextual Encoding for 3D Point CloudsCode1
RfD-Net: Point Scene Understanding by Semantic Instance ReconstructionCode1
Visual place recognition: A survey from deep learning perspectiveCode1
RELLIS-3D Dataset: Data, Benchmarks and AnalysisCode1
SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple EnvironmentsCode1
Towards Efficient Scene Understanding via Squeeze ReasoningCode1
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic SegmentationCode1
Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce ModelCode1
RADIATE: A Radar Dataset for Automotive Perception in Bad WeatherCode1
ALFWorld: Aligning Text and Embodied Environments for Interactive LearningCode1
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene UnderstandingCode1
BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving EnvironmentsCode1
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and ChallengesCode1
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor SceneCode1
Polysemy Deciphering Network for Robust Human-Object Interaction DetectionCode1
Pose-based Modular Network for Human-Object Interaction DetectionCode1
Polysemy Deciphering Network for Human-Object Interaction DetectionCode1
Weakly Supervised 3D Object Detection from Point CloudsCode1
Virtual Multi-view Fusion for 3D Semantic SegmentationCode1
Few-Shot Object Detection and Viewpoint Estimation for Objects in the WildCode1
PointContrast: Unsupervised Pre-training for 3D Point Cloud UnderstandingCode1
Show:102550
← PrevPage 17 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified