SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 14261450 of 1723 papers

TitleStatusHype
Weakly-Supervised 3D Visual Grounding based on Visual Linguistic Alignment0
Attributes-aware Visual Emotion Representation Learning0
SpatialLM: Training Large Language Models for Structured Indoor Modeling0
Spatial Sampling Network for Fast Scene Understanding0
Spatiotemporal Event Graphs for Dynamic Scene Understanding0
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data0
Attention Mechanism based Cognition-level Scene Understanding0
SpectralGaussians: Semantic, spectral 3D Gaussian splatting for multi-spectral scene representation, visualization and analysis0
Attention-based transformer models for image captioning across languages: An in-depth survey and evaluation0
SpeedMachines: Anytime Structured Prediction0
Spherical World-Locking for Audio-Visual Localization in Egocentric Videos0
Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification0
A transition towards virtual representations of visual scenes0
SplatTalk: 3D VQA with Gaussian Splatting0
SpSequenceNet: Semantic Segmentation Network on 4D Point Clouds0
A Task-Oriented Approach for Cost-Sensitive Recognition0
A Systematic Literature Review on Deep Learning-based Depth Estimation in Computer Vision0
sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views0
SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images0
A Survey on Knowledge Graph-based Methods for Automated Driving0
A Survey on Deep Learning Methods for Semantic Image Segmentation in Real-Time0
A Survey of Knowledge Representation in Service Robotics0
StandardSim: A Synthetic Dataset For Retail Environments0
Assessing the performance of CT image denoisers using Laguerre-Gauss Channelized Hotelling Observer for lesion detection0
Stochastic Future Prediction in Real World Driving Scenarios0
Show:102550
← PrevPage 58 of 69Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified