SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12011250 of 1723 papers

TitleStatusHype
Scale-aware Neural Network for Semantic Segmentation of Multi-resolution Remote Sensing Images0
Monte Carlo Scene Search for 3D Scene UnderstandingCode1
Affect2MM: Affective Analysis of Multimedia Content Using Emotion CausalityCode1
Holistic 3D Scene Understanding from a Single Image with Implicit RepresentationCode1
Exploiting Edge-Oriented Reasoning for 3D Point-based Scene Graph AnalysisCode1
Fine-Grained Off-Road Semantic Segmentation and Mapping via Contrastive Learning0
Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth EstimationCode0
Simulation-to-Real domain adaptation with teacher-student learning for endoscopic instrument segmentation0
FPS-Net: A Convolutional Fusion Network for Large-Scale LiDAR Point Cloud SegmentationCode1
Panoramic Panoptic Segmentation: Towards Complete Surrounding Understanding via Unsupervised Contrastive LearningCode1
A Kinematic Bottleneck Approach For Pose Regression of Flexible Surgical Instruments directly from Images0
Boundary-induced and scene-aggregated network for monocular depth predictionCode1
4D Panoptic LiDAR SegmentationCode1
RGB-D Railway Platform Monitoring and Scene Understanding for Enhanced Passenger SafetyCode1
Weakly Supervised Learning of Rigid 3D Scene FlowCode1
A2-FPN for Semantic Segmentation of Fine-Resolution Remotely Sensed ImagesCode1
Audiovisual Highlight Detection in Videos0
Single-Shot Cuboids: Geodesics-based End-to-end Manhattan Aligned Layout Estimation from Spherical PanoramasCode1
Bidirectional Multi-scale Attention Networks for Semantic Segmentation of Oblique UAV ImageryCode0
Optical flow and scene flow estimation: A survey0
Deep Learning--Based Scene Simplification for Bionic VisionCode0
OpenGF: An Ultra-Large-Scale Ground Filtering Dataset Built Upon Open ALS Point Clouds Around the WorldCode1
The Ikshana Hypothesis of Human Scene UnderstandingCode0
Rethinking Semantic Segmentation Evaluation for Explainability and Model Selection0
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images0
Automatic Extrinsic Calibration Method for LiDAR and Camera Sensor SetupsCode1
Grounding Consistency: Distilling Spatial Common Sense for Precise Visual Relationship DetectionCode1
BUTLER: Building Understanding in TextWorld via Language for Embodied Reasoning0
Non-maximum Suppression Also Closes the Variational Approximation Gap of Multi-object Variational Autoencoders0
Pseudo Label-Guided Multi Task Learning for Scene Understanding0
Scene Text Detection for Augmented Reality -- Character Bigram Approach to reduce False Positive Rate0
P4Contrast: Contrastive Learning with Pairs of Point-Pixel Pairs for RGB-D Scene Understanding0
Classification of Single-View Object Point Clouds0
Embodied Visual Active Learning for Semantic Segmentation0
Event-based Motion Segmentation with Spatio-Temporal Graph CutsCode1
Exploring Data-Efficient 3D Scene Understanding with Contrastive Scene ContextsCode1
Practical Auto-Calibration for Spatial Scene-Understanding from Crowdsourced Dashcamera Videos0
Image-Graph-Image Translation via Auto-Encoding0
Competitive Simplicity for Multi-Task Learning for Real-Time Foggy Scene Understanding via Domain Adaptation0
Multi-Model Learning for Real-Time Automotive Semantic Foggy Scene Understanding via Domain Adaptation0
Robust Neural Routing Through Space Partitions for Camera Relocalization in Dynamic Indoor EnvironmentsCode1
FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene UnderstandingCode1
Understanding Bird's-Eye View of Road Semantics using an Onboard CameraCode1
Towards Part-Based Understanding of RGB-D ScansCode1
Group Contextual Encoding for 3D Point CloudsCode1
RfD-Net: Point Scene Understanding by Semantic Instance ReconstructionCode1
Exploring Deep 3D Spatial Encodings for Large-Scale 3D Scene Understanding0
Multi-task GANs for Semantic Segmentation and Depth Completion with Cycle Consistency0
Visual place recognition: A survey from deep learning perspectiveCode1
The Devil is in the Boundary: Exploiting Boundary Representation for Basis-based Instance Segmentation0
Show:102550
← PrevPage 25 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified