SOTAVerified

Scene Parsing

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Papers

Showing 5175 of 199 papers

TitleStatusHype
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Radar Spectra-Language Model for Automotive Scene Parsing0
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW20240
Few-Shot Fruit Segmentation via Transfer LearningCode0
Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks0
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature FusionCode0
Feature boosting with efficient attention for scene parsing0
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene ParsingCode0
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field CamerasCode0
SAI3D: Segment Any Instance in 3D Scenes0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene ParsingCode0
CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration0
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing0
CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing0
Improving Panoptic Segmentation for Nighttime or Low-Illumination Urban Driving ScenesCode0
Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach0
Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing0
Cross-CBAM: A Lightweight network for Scene Segmentation0
Treasure What You Have: Exploiting Similarity in Deep Neural Networks for Efficient Video Processing0
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction0
Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving0
Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing0
Visual Traffic Knowledge Graph Generation from Scene Images0
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection0
Show:102550
← PrevPage 3 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGDPNetTotal Accuracy84.7Unverified
2Inter-GPSTotal Accuracy27.3Unverified
#ModelMetricClaimedVerifiedStatus
1VCD No CoarsemIoU82.3Unverified