SOTAVerified

Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Showing 826850 of 895 papers

TitleStatusHype
Selective Video Object Cutout0
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos0
Semantic Video Segmentation by Gated Recurrent Flow Propagation0
Unsupervised Video Segmentation via Spatio-Temporally Nonlocal Appearance Learning0
Video Propagation Networks0
Learning Video Object Segmentation from Static ImagesCode0
Convolutional Gated Recurrent Networks for Video Segmentation0
Can Ground Truth Label Propagation from Video help Semantic Segmentation?0
Towards Segmenting Consumer Stereo Videos: Benchmark, Baselines and Ensembles0
STFCN: Spatio-Temporal FCN for Semantic Video SegmentationCode0
Object Detection, Tracking, and Motion Segmentation for Object-level Video Segmentation0
Analyzing Linear Dynamical Systems: From Modeling to Coding and LearningCode0
Approximate Policy Iteration for Budgeted Semantic Video Segmentation0
Click Carving: Segmenting Objects in Video with Point Clicks0
Deep Learning Markov Random Field for Semantic Segmentation0
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy Object Models0
Point-wise mutual information-based video segmentation with high temporal consistency0
Semi-Supervised Domain Adaptation for Weakly Labeled Semantic Video Object Segmentation0
Instance-Level Video Segmentation From Object Tracks0
A Benchmark Dataset and Evaluation Methodology for Video Object SegmentationCode0
Recurrent Fully Convolutional Networks for Video Segmentation0
Video Segmentation via Object Flow0
Coherent Parametric Contours for Interactive Video Object Segmentation0
Bilateral Space Video Segmentation0
Feature Space Optimization for Semantic Video SegmentationCode0
Show:102550
← PrevPage 34 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TMANet-50mIoU80.3Unverified
2TDNet-50 [9]mIoU79.9Unverified
3DeltaDist-DDRNet-39mIoU79.9Unverified
4PSPNet-101 [20]mIoU79.7Unverified
5PSPNet-50 [20]mIoU78.1Unverified
6LVS [12]mIoU76.8Unverified
7GRFP [15]mIoU73.6Unverified
8FCN-50 [14]mIoU70.1Unverified
9DFF [22]mIoU69.2Unverified
#ModelMetricClaimedVerifiedStatus
1TMANet-50Mean IoU76.5Unverified
2ETC-MobileNetMean IoU76.3Unverified
3TDNet-50Mean IoU76.2Unverified
4PSPNet-50Mean IoU76Unverified
5NetwarpMean IoU74.7Unverified
6GRFPMean IoU67.1Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mIoU63.8Unverified
2UniVS(Swin-L)mIoU59.8Unverified
3Tube-Link(Swin-large)mIoU59.6Unverified
4MRCFA(MiT-B5)mIoU49.9Unverified
5CFFM(MiT-B5)mIoU49.3Unverified
#ModelMetricClaimedVerifiedStatus
1WaSR-T (ResNet-101)Q60.1Unverified
2TMANet (ResNet-50)Q57.5Unverified
3CSANet (ResNet-101)Q49.1Unverified
#ModelMetricClaimedVerifiedStatus
1MVNet(DeepLabV3)mIoU54.52Unverified
2MVNet(PSPNet)mIoU54.36Unverified
3MVNet(FCN)mIoU53.9Unverified