SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12011250 of 1723 papers

TitleStatusHype
Unsupervised Single-shot Depth Estimation using Perceptual ReconstructionCode0
Moving Beyond Navigation with Active Neural SLAM0
Towards holistic scene understanding: Semantic segmentation and beyond0
Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety0
Scene Graph Generation: A Comprehensive Survey0
Glass Segmentation Using Intensity and Spectral Polarization Cues0
Segment-Fusion: Hierarchical Context Fusion for Robust 3D Semantic Segmentation0
Weakly Supervised Segmentation on Outdoor 4D Point Clouds With Temporal Matching and Spatial Graph PropagationCode0
HSPACE: Synthetic Parametric Humans Animated in Complex Environments0
Distillation of Human-Object Interaction Contexts for Action Recognition0
Improving Human-Object Interaction Detection via Phrase Learning and Label Composition0
Image-to-Height Domain Translation for Synthetic Aperture Sonar0
3D Scene Understanding at Urban Intersection using Stereo Vision and Digital Map0
Roominoes: Generating Novel 3D Floor Plans From Existing 3D Rooms0
4DContrast: Contrastive Learning with Dynamic Correspondences for 3D Scene Understanding0
Joint Modeling of Visual Objects and Relations for Scene Graph Generation0
Contrastive Instance Association for 4D Panoptic Segmentation using Sequences of 3D LiDAR ScansCode0
REMIPS: Physically Consistent 3D Reconstruction of Multiple Interacting People under Weak Supervision0
Both Style and Fog Matter: Cumulative Domain Adaptation for Semantic Foggy Scene Understanding0
Zero-Shot Semantic Segmentation via Spatial and Multi-Scale Aware Visual Class Embedding0
DiffSDFSim: Differentiable Rigid-Body Dynamics With Implicit Shapes0
PAPooling: Graph-based Position Adaptive Aggregation of Local Geometry in Point Clouds0
Not All Relations are Equal: Mining Informative Labels for Scene Graph Generation0
Joint stereo 3D object detection and implicit surface reconstructionCode0
Panoptic Segmentation Meets Remote Sensing0
Talk-to-Resolve: Combining scene understanding and spatial dialogue to resolve granular task ambiguity for a collocated robot0
Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion0
Robust deep learning-based semantic organ segmentation in hyperspectral images0
DriveGuard: Robustification of Automated Driving Systems with Deep Spatio-Temporal Convolutional Autoencoder0
When Neural Networks Using Different Sensors Create Similar Features0
Semantic Detection of Potential Wind-borne Debris in Construction Jobsites: Digital Twining for Hurricane Preparedness and Jobsite Safety0
Adversarial Scene Reconstruction and Object Detection System for Assisting Autonomous Vehicle0
Monocular Depth Estimation with Sharp Boundary0
Unsupervised Domain Adaptation for LiDAR Panoptic Segmentation0
Semantic Dense Reconstruction with Consistent Scene Segments0
Referring Self-supervised Learning on 3D Point Cloud0
D-Net: A Generalised and Optimised Deep Network for Monocular Depth EstimationCode0
Efficient Point Transformer for Large-scale 3D Scene Understanding0
Audio-Visual Collaborative Representation Learning for Dynamic Saliency Prediction0
Label-Attention Transformer with Geometrically Coherent Objects for Image CaptioningCode0
Navigation-Oriented Scene Understanding for Robotic Autonomy: Learning to Segment Driveability in Egocentric Images0
On the Sins of Image Synthesis Loss for Self-supervised Depth Estimation0
Residual 3D Scene Flow Learning with Context-Aware Feature Extraction0
Single Image 3D Object Estimation with Primitive Graph NetworksCode0
RefineCap: Concept-Aware Refinement for Image Captioning0
Improving Building Segmentation for Off-Nadir Satellite Imagery0
Binaural SoundNet: Predicting Semantics, Depth and Motion with Binaural Sounds0
Multi-task learning from fixed-wing UAV images for 2D/3D city modeling0
Deep Bayesian Image Set Classification: A Defence Approach against Adversarial Attacks0
A Multiple-View Geometric Model for Specularity Prediction on General Curved Surfaces0
Show:102550
← PrevPage 25 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified