SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 851875 of 1723 papers

TitleStatusHype
Camera-Radar Perception for Autonomous Vehicles and ADAS: Concepts, Datasets and Metrics0
CLIP-FO3D: Learning Free Open-world 3D Scene Representations from 2D Dense CLIP0
Traffic Scene Parsing through the TSP6K DatasetCode1
VTQA: Visual Text Question Answering via Entity Alignment and Cross-Media ReasoningCode0
Unified Perception: Efficient Depth-Aware Video Panoptic Segmentation with Minimal Annotation Costs0
Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning0
APARATE: Adaptive Adversarial Patch for CNN-based Monocular Depth Estimation for Autonomous Navigation0
Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors0
RemoteNet: Remote Sensing Image Segmentation Network based on Global-Local Information0
Open Challenges for Monocular Single-shot 6D Object Pose Estimation0
CEKD: Cross-Modal Edge-Privileged Knowledge Distillation for Semantic Scene Understanding Using Only Thermal ImagesCode1
Deep Learning for Event-based Vision: A Comprehensive Survey and BenchmarksCode1
Explicit3D: Graph Network with Spatial Inference for Single Image 3D Object Detection0
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose EstimationCode1
Object-Centric Scene Representations using Active Inference0
Structured Generative Models for Scene Understanding0
A Flexible Framework for Virtual Omnidirectional Vision to Improve Operator Situation Awareness0
GALIP: Generative Adversarial CLIPs for Text-to-Image SynthesisCode2
Learning from Mistakes: Self-Regularizing Hierarchical Representations in Point Cloud Semantic Segmentation0
OvarNet: Towards Open-vocabulary Object Attribute RecognitionCode1
Unleash the Potential of Image Branch for Cross-modal 3D Object DetectionCode1
Model-based inexact graph matching on top of CNNs for semantic scene understandingCode0
Long Range Pooling for 3D Large-Scale Scene Understanding0
Diffusion-based Generation, Optimization, and Planning in 3D ScenesCode2
A Comprehensive Review of Modern Object Segmentation Approaches0
Show:102550
← PrevPage 35 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified