SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 251275 of 1723 papers

TitleStatusHype
All-Day Multi-Camera Multi-Target TrackingCode1
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic SegmentationCode1
F-ViTA: Foundation Model Guided Visible to Thermal TranslationCode1
Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor SetupsCode1
From General to Specific: Informative Scene Graph Generation via Balance AdjustmentCode1
From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object DetectionCode1
AutoInst: Automatic Instance-Based Segmentation of LiDAR 3D ScansCode1
ALFWorld: Aligning Text and Embodied Environments for Interactive LearningCode1
FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier ConvolutionsCode1
Group Contextual Encoding for 3D Point CloudsCode1
Instance-wise Occlusion and Depth Orders in Natural ScenesCode1
A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed ImagesCode1
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous DrivingCode1
Few-Shot Object Detection and Viewpoint Estimation for Objects in the WildCode1
A Two-Stage Masked Autoencoder Based Network for Indoor Depth CompletionCode1
AirObject: A Temporally Evolving Graph Embedding for Object IdentificationCode1
Event-based Motion Segmentation with Spatio-Temporal Graph CutsCode1
A Hybrid Sparse-Dense Monocular SLAM System for Autonomous DrivingCode1
Explainable Object-induced Action Decision for Autonomous VehiclesCode1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph AnalysisCode1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene UnderstandingCode1
Enhancing Scene Graph Generation with Hierarchical Relationships and Commonsense KnowledgeCode1
Estimating and Exploiting the Aleatoric Uncertainty in Surface Normal EstimationCode1
Show:102550
← PrevPage 11 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified