SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 801825 of 1723 papers

TitleStatusHype
DeepContext: Context-Encoding Neural Pathways for 3D Holistic Scene Understanding0
Image-Graph-Image Translation via Auto-Encoding0
Learning to Detect Human-Object Interactions With Knowledge0
Learning to Exploit Stability for 3D Scene Parsing0
Learning to Interpret and Describe Abstract Scenes0
A model of saliency-based visual attention for rapid scene analysis0
Modeling Uncertainty in 3D Gaussian Splatting through Continuous Semantic Splatting0
Image Captioning with Integrated Bottom-Up and Multi-level Residual Top-Down Attention for Game Scene Understanding0
Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks0
Leverage Cross-Attention for End-to-End Open-Vocabulary Panoptic Reconstruction0
Discovery of Shared Semantic Spaces for Multi-Scene Video Query and Summarization0
An Exemplar-based CRF for Multi-instance Object Segmentation0
Leveraging Auxiliary Text for Deep Recognition of Unseen Visual Relationships0
IM2CAD0
Leveraging Large-Scale Pretrained Vision Foundation Models for Label-Efficient 3D Point Cloud Segmentation0
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features0
Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding0
AVD2: Accident Video Diffusion for Accident Video Description0
Lifting GIS Maps into Strong Geometric Context for Scene Understanding0
MM-3DScene: 3D Scene Understanding by Customizing Masked Modeling with Informative-Preserved Reconstruction and Self-Distilled Consistency0
Identifying First-person Camera Wearers in Third-person Videos0
AmodalSynthDrive: A Synthetic Amodal Perception Dataset for Autonomous Driving0
MNEW: Multi-domain Neighborhood Embedding and Weighting for Sparse Point Clouds Segmentation0
LiveHPS: LiDAR-based Scene-level Human Pose and Shape Estimation in Free Environment0
DAWN: Vehicle Detection in Adverse Weather Nature Dataset0
Show:102550
← PrevPage 33 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified