SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 15011550 of 1723 papers

TitleStatusHype
Multiview Based 3D Scene Understanding On Partial Point Sets0
ShelfNet for Fast Semantic SegmentationCode0
MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object LocalizationCode0
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained EnvironmentsCode0
A pooling based scene text proposal technique for scene text reading in the wild0
Artificial Color Constancy via GoogLeNet with Angular Loss FunctionCode0
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery0
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning0
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose EstimationCode0
UAVid: A Semantic Segmentation Dataset for UAV ImageryCode0
Diagnostics in Semantic Segmentation0
Semantic and structural image segmentation for prosthetic vision0
Incorporating Luminance, Depth and Color Information by a Fusion-based Network for Semantic SegmentationCode0
A Variational Observation Model of 3D Object for Probabilistic Semantic SLAM0
Answering Visual What-If Questions: From Actions to Predicted Scene Descriptions0
Context-Dependent Diffusion Network for Visual Relationship Detection0
On the Importance of Visual Context for Data Augmentation in Scene Understanding0
Deep Depth from Defocus: how can defocus blur improve 3D estimation using dense neural networks?Code0
Modeling human intuitions about liquid flow with particle-based simulation0
BOLD5000: A public fMRI dataset of 5000 imagesCode0
Soft-PHOC Descriptor for End-to-End Word Spotting in Egocentric Scene ImagesCode0
Multiple-gaze geometry: Inferring novel 3D locations from gazes observed in monocular video0
Localization Guided Learning for Pedestrian Attribute Recognition0
COFGA: Classification Of Fine-Grained Features In Aerial Images0
Single Shot Scene Text RetrievalCode0
NavigationNet: A Large-scale Interactive Indoor Navigation Dataset0
Second-order Democratic Aggregation0
Deep Learned Full-3D Object Completion from Single View0
Learning Monocular Depth by Distilling Cross-domain Stereo NetworksCode0
Holistic 3D Scene Parsing and Reconstruction from a Single RGB ImageCode0
Parsing Geometry Using Structure-Aware Shape TemplatesCode0
Model Adaptation with Synthetic and Real Data for Semantic Dense Foggy Scene Understanding0
A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators0
A Reinforcement Learning Approach to Target Tracking in a Camera Network0
In pixels we trust: From Pixel Labeling to Object Localization and Scene Categorization0
Three for one and one for three: Flow, Segmentation, and Surface NormalsCode0
Visual Affordance and Function Understanding: A Survey0
A Reflectance Based Method For Shadow Detection and Removal0
End-to-End Race Driving with Deep Reinforcement Learning0
A Survey of Knowledge Representation in Service Robotics0
Online Self-supervised Scene Segmentation for Micro Aerial Vehicles0
DenseASPP for Semantic Segmentation in Street ScenesCode0
Inferring Shared Attention in Social Scene Videos0
3D-RCNN: Instance-Level 3D Object Reconstruction via Render-and-Compare0
Scene Understanding Networks for Autonomous Driving based on Around View Monitoring System0
Auxiliary Tasks in Multi-task LearningCode0
Vision-based Automated Bridge Component Recognition Integrated With High-level Scene Understanding0
PAD-Net: Multi-Tasks Guided Prediction-and-Distillation Network for Simultaneous Depth Estimation and Scene Parsing0
Multi-Resolution Multi-Modal Sensor Fusion For Remote Sensing Data With Label UncertaintyCode0
EML-NET:An Expandable Multi-Layer NETwork for Saliency Prediction0
Show:102550
← PrevPage 31 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified