SOTAVerified

Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Showing 551575 of 895 papers

TitleStatusHype
D3S - A Discriminative Single Shot Segmentation Tracker0
DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation0
Decoupled Motion Expression Video Segmentation0
Deep End2End Voxel2Voxel Prediction0
Deep learning approaches to surgical video segmentation and object detection: A Scoping Review0
Deep Learning Markov Random Field for Semantic Segmentation0
Deeply Interleaved Two-Stream Encoder for Referring Video Segmentation0
DeepPyramid+: Medical Image Segmentation using Pyramid View Fusion and Deformable Pyramid Reception0
Deep Spatio-Temporal Random Fields for Efficient Video Segmentation0
Deep Transport Network for Unsupervised Video Object Segmentation0
Deep Unfolding-Aided Parameter Tuning for Plug-and-Play-Based Video Snapshot Compressive Imaging0
Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation0
DeU-Net: Deformable U-Net for 3D Cardiac MRI Video Segmentation0
DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation0
Discrete-Continuous ADMM for Transductive Inference in Higher-Order MRFs0
Discriminative Online Learning for Fast Video Object Segmentation0
Distortion-Aware Network Pruning and Feature Reuse for Real-time Video Segmentation0
DNN-Driven Compressive Offloading for Edge-Assisted Semantic Video Segmentation0
Harnessing Vision-Language Pretrained Models with Temporal-Aware Adaptation for Referring Video Object Segmentation0
Dual Temporal Memory Network for Efficient Video Object Segmentation0
A Semi-Self-Supervised Approach for Dense-Pattern Video Object Segmentation0
Dynamic Face Video Segmentation via Reinforcement Learning0
Dynamic Video Segmentation Network0
Efficient Heterogeneous Video Segmentation at the Edge0
Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation0
Show:102550
← PrevPage 23 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TMANet-50mIoU80.3Unverified
2DeltaDist-DDRNet-39mIoU79.9Unverified
3TDNet-50 [9]mIoU79.9Unverified
4PSPNet-101 [20]mIoU79.7Unverified
5PSPNet-50 [20]mIoU78.1Unverified
6LVS [12]mIoU76.8Unverified
7GRFP [15]mIoU73.6Unverified
8FCN-50 [14]mIoU70.1Unverified
9DFF [22]mIoU69.2Unverified
#ModelMetricClaimedVerifiedStatus
1TMANet-50Mean IoU76.5Unverified
2ETC-MobileNetMean IoU76.3Unverified
3TDNet-50Mean IoU76.2Unverified
4PSPNet-50Mean IoU76Unverified
5NetwarpMean IoU74.7Unverified
6GRFPMean IoU67.1Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mIoU63.8Unverified
2UniVS(Swin-L)mIoU59.8Unverified
3Tube-Link(Swin-large)mIoU59.6Unverified
4MRCFA(MiT-B5)mIoU49.9Unverified
5CFFM(MiT-B5)mIoU49.3Unverified
#ModelMetricClaimedVerifiedStatus
1WaSR-T (ResNet-101)Q60.1Unverified
2TMANet (ResNet-50)Q57.5Unverified
3CSANet (ResNet-101)Q49.1Unverified
#ModelMetricClaimedVerifiedStatus
1MVNet(DeepLabV3)mIoU54.52Unverified
2MVNet(PSPNet)mIoU54.36Unverified
3MVNet(FCN)mIoU53.9Unverified