SOTAVerified

Scene Parsing

Scene parsing is to segment and parse an image into different image regions associated with semantic categories, such as sky, road, person, and bed. MIT Description

Papers

Showing 51100 of 199 papers

TitleStatusHype
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Radar Spectra-Language Model for Automotive Scene Parsing0
Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW20240
Few-Shot Fruit Segmentation via Transfer LearningCode0
Compositional Factorization of Visual Scenes with Convolutional Sparse Coding and Resonator Networks0
HAPNet: Toward Superior RGB-Thermal Scene Parsing via Hybrid, Asymmetric, and Progressive Heterogeneous Feature FusionCode0
Feature boosting with efficient attention for scene parsing0
Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene ParsingCode0
LF Tracy: A Unified Single-Pipeline Approach for Salient Object Detection in Light Field CamerasCode0
SAI3D: Segment Any Instance in 3D Scenes0
A Data-efficient Framework for Robotics Large-scale LiDAR Scene ParsingCode0
A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors0
CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration0
RoadFormer: Duplex Transformer for RGB-Normal Semantic Road Scene Parsing0
CACFNet: Cross-Modal Attention Cascaded Fusion Network for RGB-T Urban Scene Parsing0
Improving Panoptic Segmentation for Nighttime or Low-Illumination Urban Driving ScenesCode0
Semantic Segmentation on VSPW Dataset through Contrastive Loss and Multi-dataset Training Approach0
Recyclable Semi-supervised Method Based on Multi-model Ensemble for Video Scene Parsing0
Cross-CBAM: A Lightweight network for Scene Segmentation0
Treasure What You Have: Exploiting Similarity in Deep Neural Networks for Efficient Video Processing0
Local and Global Contextual Features Fusion for Pedestrian Intention Prediction0
Re:PolyWorld - A Graph Neural Network for Polygonal Scene Parsing0
Visual Traffic Knowledge Graph Generation from Scene Images0
Weakly Supervised Class-Agnostic Motion Prediction for Autonomous Driving0
Multi-Sem Fusion: Multimodal Semantic Fusion for 3D Object Detection0
GEBNet: Graph-Enhancement Branch Network for RGB-T Scene Parsing0
FLOAT: Factorized Learning of Object Attributes for Improved Multi-object Multi-part Scene ParsingCode0
Boundary Corrected Multi-scale Fusion Network for Real-time Semantic Segmentation0
Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling0
Fully Decoupled Residual ConvNet for Real-Time Railway Scene Parsing of UAV Aerial Images0
ESCNet: Gaze Target Detection With the Understanding of 3D Scenes0
MSP : Refine Boundary Segmentation via Multiscale Superpixel0
TBN-ViT: Temporal Bilateral Network with Vision Transformer for Video Scene Parsing0
Exploiting Spatial-Temporal Semantic Consistency for Video Scene Parsing0
Semantic Segmentation on VSPW Dataset through Aggregation of Transformer Models0
Memory Based Video Scene Parsing0
Window Detection In Facade Imagery: A Deep Learning Approach Using Mask R-CNN0
VSPW: A Large-scale Dataset for Video Scene Parsing in the Wild0
Aerial-PASS: Panoramic Annular Scene Segmentation in Drone Videos0
Perception Framework through Real-Time Semantic Segmentation and Scene Recognition on a Wearable System for the Visually Impaired0
SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images0
ORDNet: Capturing Omni-Range Dependencies for Scene Parsing0
An Evolution of CNN Object Classifiers on Low-Resolution Images0
Interaction via Bi-Directional Graph of Semantic Region Affinity for Scene Parsing0
Class Attention Network for Semantic Segmentation of Remote Sensing Images0
Multi-layer Feature Aggregation for Deep Scene Parsing Models0
A Dilated Residual Hierarchically Fashioned Segmentation Framework for Extracting Gleason Tissues and Grading Prostate Cancer from Whole Slide ImagesCode0
LID 2020: The Learning from Imperfect Data Challenge Results0
Automatic Quantification of Settlement Damage using Deep Learning of Satellite Images0
GINet: Graph Interaction Network for Scene ParsingCode0
Show:102550
← PrevPage 2 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1PGDPNetTotal Accuracy84.7Unverified
2Inter-GPSTotal Accuracy27.3Unverified
#ModelMetricClaimedVerifiedStatus
1VCD No CoarsemIoU82.3Unverified