SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 14511500 of 1723 papers

TitleStatusHype
Deep Reinforcement Learning on a Budget: 3D Control and Reasoning Without a SupercomputerCode0
SemanticKITTI: A Dataset for Semantic Scene Understanding of LiDAR SequencesCode0
JSIS3D: Joint Semantic-Instance Segmentation of 3D Point Clouds with Multi-Task Pointwise Networks and Multi-Value Conditional Random FieldsCode0
ResUNet-a: a deep learning framework for semantic segmentation of remotely sensed dataCode0
Do Deep Neural Networks Model Nonlinear Compositionality in the Neural Representation of Human-Object Interactions?0
Road Scene Understanding by Occupancy Grid Learning from Sparse Radar Clusters using Semantic SegmentationCode0
Auto-Embedding Generative Adversarial Networks for High Resolution Image SynthesisCode0
Veritatem Dies Aperit- Temporally Consistent Depth Prediction Enabled by a Multi-Task Geometric and Semantic Scene Understanding ApproachCode0
Monocular 3D Object Detection with Pseudo-LiDAR Point CloudCode0
Scene Understanding for Autonomous Manipulation with Deep Learning0
Quantitative Depth Quality Assessment of RGBD Cameras At Close Range Using 3D Printed FixturesCode0
Affordance Learning In Direct Perception for Autonomous Driving0
Real time backbone for semantic segmentation0
Instance- and Category-level 6D Object Pose Estimation0
Building an Affordances Map with Interactive Perception0
Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction0
The H3D Dataset for Full-Surround 3D Multi-Object Detection and Tracking in Crowded Urban Scenes0
An efficient solution for semantic segmentation: ShuffleNet V2 with atrous separable convolutionsCode0
Deeply Supervised Multimodal Attentional Translation Embeddings for Visual Relationship DetectionCode0
Gated2Depth: Real-time Dense Lidar from Gated ImagesCode0
Software-Defined FPGA Accelerator Design for Mobile Deep Learning Applications0
Single Network Panoptic Segmentation for Street Scene UnderstandingCode0
Real-time 3D Traffic Cone Detection for Autonomous DrivingCode0
VrR-VG: Refocusing Visually-Relevant Relationships0
Skip-GANomaly: Skip Connected and Adversarially Trained Encoder-Decoder Anomaly DetectionCode0
Neural RGB->D Sensing: Depth and Uncertainty from a Video CameraCode0
Curriculum Model Adaptation with Synthetic and Real Data for Semantic Foggy Scene UnderstandingCode1
Learning Spatial Common Sense with Geometry-Aware Recurrent Networks0
Impact of Ground Truth Annotation Quality on Performance of Semantic Image Segmentation of Traffic ConditionsCode0
Reasoning About Physical Interactions with Object-Oriented Prediction and Planning0
Learning Direct Optimization for Scene Understanding0
Not Using the Car to See the Sidewalk: Quantifying and Controlling the Effects of Context in Classification and Segmentation0
An Intelligent Safety System for Human-Centered Semi-Autonomous Vehicles0
Counterfactual Critic Multi-Agent Training for Scene Graph Generation0
The Right (Angled) Perspective: Improving the Understanding of Road Scenes Using Boosted Inverse Perspective Mapping0
Submodular Field Grammars: Representation, Inference, and Application to Image Parsing0
Learning to Exploit Stability for 3D Scene Parsing0
Multiview Based 3D Scene Understanding On Partial Point Sets0
ShelfNet for Fast Semantic SegmentationCode0
MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object LocalizationCode0
IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained EnvironmentsCode0
A pooling based scene text proposal technique for scene text reading in the wild0
Artificial Color Constancy via GoogLeNet with Angular Loss FunctionCode0
Sensor Adaptation for Improved Semantic Segmentation of Overhead Imagery0
Toward Driving Scene Understanding: A Dataset for Learning Driver Behavior and Causal Reasoning0
Cooperative Holistic Scene Understanding: Unifying 3D Object, Layout, and Camera Pose EstimationCode0
UAVid: A Semantic Segmentation Dataset for UAV ImageryCode0
Multi-Task Learning as Multi-Objective OptimizationCode2
Diagnostics in Semantic Segmentation0
Semantic and structural image segmentation for prosthetic vision0
Show:102550
← PrevPage 30 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified