SOTAVerified

Scene Parsing

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Papers

Showing 150 of 199 papers

TitleStatusHype
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Robust Shape Fitting for 3D Scene AbstractionCode2
Kimera: from SLAM to Spatial Perception with 3D Dynamic Scene GraphsCode2
OCNet: Object Context Network for Scene ParsingCode2
Editable Free-viewpoint Video Using a Layered Neural RepresentationCode1
Uni-3D: A Universal Model for Panoptic 3D Scene ReconstructionCode1
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene ParsingCode1
Pointly-supervised 3D Scene Parsing with Viewpoint BottleneckCode1
Multi-Grained Contrast for Data-Efficient Unsupervised Representation LearningCode1
Sketching Image Gist: Human-Mimetic Hierarchical Scene Graph GenerationCode1
Traffic Scene Parsing through the TSP6K DatasetCode1
Resource Efficient Mountainous Skyline Extraction using Shallow LearningCode1
BORM: Bayesian Object Relation Model for Indoor Scene RecognitionCode1
Pyramidal Convolution: Rethinking Convolutional Neural Networks for Visual RecognitionCode1
Global Aggregation then Local Distribution for Scene ParsingCode1
GFF: Gated Fully Fusion for Semantic SegmentationCode1
Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene ParsingCode1
Strip Pooling: Rethinking Spatial Pooling for Scene ParsingCode1
Deep Dual-resolution Networks for Real-time and Accurate Semantic Segmentation of Road ScenesCode1
Panoptic SegmentationCode1
Plane Geometry Diagram ParsingCode1
EPSNet: Efficient Panoptic Segmentation Network with Cross-layer Attention FusionCode1
Semantic Flow for Fast and Accurate Scene ParsingCode1
CascadePSP: Toward Class-Agnostic and Very High-Resolution Segmentation via Global and Local RefinementCode1
TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop ScenesCode1
VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum ModelingCode1
RT-K-Net: Revisiting K-Net for Real-Time Panoptic SegmentationCode1
3D-to-2D Distillation for Indoor Scene ParsingCode1
Part-aware Panoptic SegmentationCode1
Fast and Accurate Scene Parsing via Bi-direction Alignment NetworksCode1
Context-Aware Synthesis and Placement of Object InstancesCode1
AttaNet: Attention-Augmented Network for Fast and Accurate Scene ParsingCode1
Evidential fully convolutional network for semantic segmentationCode1
DPF: Learning Dense Prediction Fields with Weak SupervisionCode1
Mesh Convolution with Continuous Filters for 3D Surface ParsingCode1
Edge-aware Guidance Fusion Network for RGB Thermal Scene ParsingCode1
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic ReasoningCode1
Minimal Solvers for Single-View Lens-Distorted Camera Auto-CalibrationCode1
EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene ParsingCode1
Pyramid Scene Parsing NetworkCode1
Boosting Night-time Scene Parsing with Learnable FrequencyCode1
Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
Adaptive Context Network for Scene Parsing0
Class Attention Network for Semantic Segmentation of Remote Sensing Images0
CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration0
An Evolution of CNN Object Classifiers on Low-Resolution Images0
3D Scene Parsing via Class-Wise Adaptation0
CaseNet: Content-Adaptive Scale Interaction Networks for Scene Parsing0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGDPNetTotal Accuracy84.7Unverified
2Inter-GPSTotal Accuracy27.3Unverified
#ModelMetricClaimedVerifiedStatus
1VCD No CoarsemIoU82.3Unverified