SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 15011550 of 1723 papers

TitleStatusHype
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic SegmentationCode0
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM0
Context-Dependent Diffusion Network for Visual Relationship Detection0
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions0
On the Importance of Visual Context for Data Augmentation in Scene Understanding0
Modeling human intuitions about liquid flow with particle-based simulation0
Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?Code0
BOLD5000: A public fMRI dataset of 5000 imagesCode0
Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene ImagesCode0
Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular video0
Localization Guided Learning for Pedestrian Attribute Recognition0
Single Shot Scene Text RetrievalCode0
COFGA: Classification Of Fine-Grained Features In Aerial Images0
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset0
Second-order Democratic Aggregation0
Deep Learned Full-3D Object Completion from Single View0
Learning Monocular Depth by Distilling Cross-domain Stereo NetworksCode0
Holistic 3D Scene Parsing and Reconstruction from a Single RGB ImageCode0
Parsing Geometry Using Structure-Aware Shape TemplatesCode0
Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding0
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators0
Unified Perceptual Parsing for Scene UnderstandingCode1
A Reinforcement Learning Approach to Target Tracking in a Camera Network0
Three for one and one for three: Flow, Segmentation, and Surface NormalsCode0
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization0
Visual Affordance and Function Understanding: A Survey0
Visual Graphs from Motion (VGfM): Scene understanding with object geometry reasoningCode1
A Reflectance Based Method For Shadow Detection and Removal0
End-to-End Race Driving with Deep Reinforcement Learning0
A Survey of Knowledge Representation in Service Robotics0
Online Self-supervised Scene Segmentation for Micro Aerial Vehicles0
Digging Into Self-Supervised Monocular Depth EstimationCode1
3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare0
Inferring Shared Attention in Social Scene Videos0
DenseASPP for Semantic Segmentation in Street ScenesCode0
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System0
Auxiliary Tasks in Multi-task LearningCode0
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding0
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing0
Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label UncertaintyCode0
EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction0
An Anti-fraud System for Car Insurance Claim Based on Visual Evidence0
On the iterative refinement of densely connected representation levels for semantic segmentationCode0
Spatiotemporal Learning of Dynamic Gestures from 3D Point Cloud Data0
Deep cross-domain building extraction for selective depth estimation from oblique aerial imagery0
VLocNet++: Deep Multitask Learning for Semantic Visual Localization and Odometry0
LoST? Appearance-Invariant Place Recognition for Opposite Viewpoints using Visual SemanticsCode0
Learning Rigidity in Dynamic Scenes with a Moving Camera for 3D Motion Field EstimationCode0
Learning Depth from Single Images with Deep Neural Network Embedding Focal Length0
DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny ObjectsCode1
Show:102550
← PrevPage 31 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified