SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 14511500 of 1723 papers

TitleStatusHype
Implicit Background Estimation for Semantic SegmentationCode0
Spatial Sampling Network for Fast Scene Understanding0
Bridging Stereo Matching and Optical Flow via Spatiotemporal CorrespondenceCode0
Real-time Approximate Bayesian Computation for Scene Understanding0
Unsupervised Domain Adaptation using Generative Adversarial Networks for Semantic Segmentation of Aerial ImagesCode0
A Joint Convolutional Neural Networks and Context Transfer for Street Scenes Labeling0
An Information-Theoretic Metric of Transferability for Task Transfer LearningCode0
Reasoning About Physical Interactions with Object-Centric Models0
Segmenting the FutureCode0
DirectShape: Direct Photometric Alignment of Shape Priors for Visual Vehicle Pose and Shape Estimation0
Deep Optics for Monocular Depth Estimation and 3D Object Detection0
DSNet: An Efficient CNN for Road Scene Segmentation0
Deep Surface Normal Estimation with Hierarchical RGB-D FusionCode0
Structured agents for physical construction0
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a SupercomputerCode0
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR SequencesCode0
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random FieldsCode0
ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed dataCode0
Road Scene Understanding by Occupancy Grid Learning from Sparse Radar Clusters using Semantic SegmentationCode0
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?0
Auto-Embedding Generative Adversarial Networks for High Resolution Image SynthesisCode0
Veritatem Dies Aperit- Temporally Consistent Depth Prediction Enabled by a Multi-Task Geometric and Semantic Scene Understanding ApproachCode0
Scene Understanding for Autonomous Manipulation with Deep Learning0
Monocular 3D Object Detection with Pseudo-LiDAR Point CloudCode0
Quantitative Depth Quality Assessment of RGBD Cameras At Close Range Using 3D Printed FixturesCode0
Affordance Learning In Direct Perception for Autonomous Driving0
Real time backbone for semantic segmentation0
Instance- and Category-level 6D Object Pose Estimation0
Building an Affordances Map with Interactive Perception0
Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction0
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes0
An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutionsCode0
Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship DetectionCode0
Gated2Depth: Real-time Dense Lidar from Gated ImagesCode0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications0
Single Network Panoptic Segmentation for Street Scene UnderstandingCode0
Real-time 3D Traffic Cone Detection for Autonomous DrivingCode0
VrR-VG: Refocusing Visually-Relevant Relationships0
Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly DetectionCode0
Neural RGB->D Sensing: Depth and Uncertainty from a Video CameraCode0
Learning Spatial Common Sense with Geometry-Aware Recurrent Networks0
Impact of Ground Truth Annotation Quality on Performance of Semantic Image Segmentation of Traffic ConditionsCode0
Reasoning About Physical Interactions with Object-Oriented Prediction and Planning0
Learning Direct Optimization for Scene Understanding0
Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation0
An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles0
Counterfactual Critic Multi-Agent Training for Scene Graph Generation0
The Right (Angled) Perspective: Improving the Understanding of Road Scenes Using Boosted Inverse Perspective Mapping0
Learning to Exploit Stability for 3D Scene Parsing0
Submodular Field Grammars: Representation, Inference, and Application to Image Parsing0
Show:102550
← PrevPage 30 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified