SOTAVerified

Scene Parsing

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Papers

Showing 150 of 199 papers

TitleStatusHype
A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects0
DepthMatch: Semi-Supervised RGB-D Scene Parsing through Depth-Guided Regularization0
MCCD: Multi-Agent Collaboration-based Compositional Diffusion for Complex Text-to-Image Generation0
Fully Exploiting Vision Foundation Model's Profound Prior Knowledge for Generalizable RGB-Depth Driving Scene Parsing0
Hardware implementation of timely reliable Bayesian decision-making using memristors0
OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing0
RoadFormer+: Delivering RGB-X Scene Parsing through Scale-Aware Information Decoupling and Advanced Heterogeneous Feature Fusion0
Multi-Grained Contrast for Data-Efficient Unsupervised Representation LearningCode1
PIG: Prompt Images Guidance for Night-Time Scene ParsingCode0
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Radar Spectra-Language Model for Automotive Scene Parsing0
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW20240
Few-Shot Fruit Segmentation via Transfer LearningCode0
Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks0
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature FusionCode0
Robust Shape Fitting for 3D Scene AbstractionCode2
Feature boosting with efficient attention for scene parsing0
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene ParsingCode0
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field CamerasCode0
SAI3D: Segment Any Instance in 3D Scenes0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene ParsingCode0
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration0
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing0
CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing0
EGFNet: Edge-Aware Guidance Fusion Network for RGB–Thermal Urban Scene ParsingCode1
Improving Panoptic Segmentation for Nighttime or Low-Illumination Urban Driving ScenesCode0
Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach0
Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing0
Cross-CBAM: A Lightweight network for Scene Segmentation0
Treasure What You Have: Exploiting Similarity in Deep Neural Networks for Efficient Video Processing0
RT-K-Net: Revisiting K-Net for Real-Time Panoptic SegmentationCode1
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction0
DPF: Learning Dense Prediction Fields with Weak SupervisionCode1
Traffic Scene Parsing through the TSP6K DatasetCode1
Visual Traffic Knowledge Graph Generation from Scene Images0
Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing0
Uni-3D: A Universal Model for Panoptic 3D Scene ReconstructionCode1
Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving0
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection0
OneFormer: One Transformer to Rule Universal Image SegmentationCode3
GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing0
VIBUS: Data-efficient 3D Scene Parsing with VIewpoint Bottleneck and Uncertainty-Spectrum ModelingCode1
Boosting Night-time Scene Parsing with Learnable FrequencyCode1
A Dense Material Segmentation Dataset for Indoor and Outdoor Scene ParsingCode1
Plane Geometry Diagram ParsingCode1
FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene ParsingCode0
TO-Scene: A Large-scale Dataset for Understanding 3D Tabletop ScenesCode1
Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGDPNetTotal Accuracy84.7Unverified
2Inter-GPSTotal Accuracy27.3Unverified
#ModelMetricClaimedVerifiedStatus
1VCD No CoarsemIoU82.3Unverified