SOTAVerified

Scene Understanding

Scene understanding involves interpreting the visual information of a scene, including objects, their spatial relationships, and the overall layout. It goes beyond simple object recognition by considering the context and how objects relate to each other and the environment.

Papers

Showing 12511300 of 1723 papers

TitleStatusHype
Bridging Scene Understanding and Task Execution with Flexible Simulation Environments0
RELLIS-3D Dataset: Data, Benchmarks and AnalysisCode1
SeasonDepth: Cross-Season Monocular Depth Prediction Dataset and Benchmark under Multiple EnvironmentsCode1
FlowCaps: Optical Flow Estimation with Capsule Networks For Action Recognition0
Towards Efficient Scene Understanding via Squeeze ReasoningCode1
Hypersim: A Photorealistic Synthetic Dataset for Holistic Indoor Scene UnderstandingCode2
S3-Net: A Fast and Lightweight Video Scene Understanding Network by Single-shot Segmentation0
Learning Regional Purity for Instance Segmentation on 3D Point CloudsCode0
Highway Driving Dataset for Semantic Video Segmentation0
Real-time Semantic Segmentation with Context Aggregation Network0
Into the Wild with AudioScope: Unsupervised Audio-Visual Separation of On-Screen Sounds0
Auto-Panoptic: Cooperative Multi-Component Architecture Search for Panoptic SegmentationCode1
Monocular Depth Estimation via Listwise Ranking using the Plackett-Luce ModelCode1
Axiom Learning and Belief Tracing for Transparent Decision Making in Robotics0
RADIATE: A Radar Dataset for Automotive Perception in Bad WeatherCode1
Unsupervised Foveal Vision Neural Networks with Top-Down Attention0
Learning Panoptic Segmentation from Instance ContoursCode0
DynaSLAM II: Tightly-Coupled Multi-Object Tracking and SLAM0
Constructing a Visual Relationship Authenticity DatasetCode0
Be Your Own Best Competitor! Multi-Branched Adversarial Knowledge Transfer0
ALFWorld: Aligning Text and Embodied Environments for Interactive LearningCode1
Weakly Supervised Learning of Multi-Object 3D Scene Decompositions Using Deep Shape Priors0
Semi-Supervised Learning for Multi-Task Scene Understanding by Neural Graph ConsensusCode0
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene UnderstandingCode1
Semi-Supervised Learning of Multi-Object 3D Scene Representations0
Learning Category- and Instance-Aware Pixel Embedding for Fast Panoptic Segmentation0
A Survey on Deep Learning Methods for Semantic Image Segmentation in Real-Time0
Towards General Purpose Geometry-Preserving Single-View Depth Estimation0
Interactive Learning for Semantic Segmentation in Earth ObservationCode0
BoMuDANet: Unsupervised Adaptation for Visual Scene Understanding in Unstructured Driving EnvironmentsCode1
ePointDA: An End-to-End Simulation-to-Real Domain Adaptation Framework for LiDAR Point Cloud Segmentation0
Towards Semantic Segmentation of Urban-Scale 3D Point Clouds: A Dataset, Benchmarks and ChallengesCode1
On the Structures of Representation for the Robustness of Semantic Segmentation to Input CorruptionCode0
Deep Learning Techniques for Geospatial Data Analysis0
Minimal Adversarial Examples for Deep Learning on 3D Point Clouds0
TORNADO-Net: mulTiview tOtal vaRiatioN semAntic segmentation with Diamond inceptiOn module0
m2caiSeg: Semantic Segmentation of Laparoscopic Images using Convolutional Neural NetworksCode0
MLM: A Benchmark Dataset for Multitask Learning with Multiple Languages and ModalitiesCode0
DAWN: Vehicle Detection in Adverse Weather Nature Dataset0
Factor Graph based 3D Multi-Object Tracking in Point Clouds0
Campus3D: A Photogrammetry Point Cloud Benchmark for Hierarchical Understanding of Outdoor SceneCode1
Polysemy Deciphering Network for Robust Human-Object Interaction DetectionCode1
Global Context Aware Convolutions for 3D Point Cloud Understanding0
Pose-based Modular Network for Human-Object Interaction DetectionCode1
Leveraging Acoustic Images for Effective Self-Supervised Audio Representation LearningCode0
Polysemy Deciphering Network for Human-Object Interaction DetectionCode1
Weakly Supervised 3D Object Detection from Point CloudsCode1
Virtual Multi-view Fusion for 3D Semantic SegmentationCode1
OpenRooms: An End-to-End Open Framework for Photorealistic Indoor Scene Datasets0
Few-Shot Object Detection and Viewpoint Estimation for Objects in the WildCode1
Show:102550
← PrevPage 26 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.44Unverified
2Team VGAI (TCS Research)OMQ0.37Unverified
3Demo_semantic_SLAMOMQ0.11Unverified
#ModelMetricClaimedVerifiedStatus
1CPN(ResNet-101)Mean IoU46.3Unverified
#ModelMetricClaimedVerifiedStatus
1ACRV BaselineOMQ0.35Unverified