SOTAVerified

Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Showing 851875 of 895 papers

TitleStatusHype
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-trainingCode0
Local Memory Attention for Fast Video Semantic SegmentationCode0
Ground-truth or DAER: Selective Re-query of Secondary InformationCode0
DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-ResolutionCode0
D3S -- A Discriminative Single Shot Segmentation TrackerCode0
Video Object Segmentation With Dynamic Memory Networks and Adaptive Object AlignmentCode0
Unsupervised Online Video Object Segmentation with Motion Property UnderstandingCode0
Spatiotemporal CNN for Video Object SegmentationCode0
arcjetCV: an open-source software to analyze material ablationCode0
Annolid: Annotate, Segment, and Track Anything You NeedCode0
Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic SegmentationCode0
AGSS-VOS: Attention Guided Single-Shot Video Object SegmentationCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
Unsupervised Video Object Segmentation for Deep Reinforcement LearningCode0
A Generative Appearance Model for End-to-end Video Object SegmentationCode0
Learning Video Object Segmentation from Static ImagesCode0
Stable Mean Teacher for Semi-supervised Video Action DetectionCode0
Learning Unsupervised Video Object Segmentation Through Visual AttentionCode0
Curriculum Learning for Recurrent Video Object SegmentationCode0
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
STFCN: Spatio-Temporal FCN for Semantic Video SegmentationCode0
Stochastic Block Models are a Discrete Surface TensionCode0
Representation Recycling for Streaming Video AnalysisCode0
Video Object Segmentation with Re-identificationCode0
An Image Processing Pipeline for Camera Trap Time-Lapse RecordingsCode0
Show:102550
← PrevPage 35 of 36Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TMANet-50mIoU80.3Unverified
2DeltaDist-DDRNet-39mIoU79.9Unverified
3TDNet-50 [9]mIoU79.9Unverified
4PSPNet-101 [20]mIoU79.7Unverified
5PSPNet-50 [20]mIoU78.1Unverified
6LVS [12]mIoU76.8Unverified
7GRFP [15]mIoU73.6Unverified
8FCN-50 [14]mIoU70.1Unverified
9DFF [22]mIoU69.2Unverified
#ModelMetricClaimedVerifiedStatus
1TMANet-50Mean IoU76.5Unverified
2ETC-MobileNetMean IoU76.3Unverified
3TDNet-50Mean IoU76.2Unverified
4PSPNet-50Mean IoU76Unverified
5NetwarpMean IoU74.7Unverified
6GRFPMean IoU67.1Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mIoU63.8Unverified
2UniVS(Swin-L)mIoU59.8Unverified
3Tube-Link(Swin-large)mIoU59.6Unverified
4MRCFA(MiT-B5)mIoU49.9Unverified
5CFFM(MiT-B5)mIoU49.3Unverified
#ModelMetricClaimedVerifiedStatus
1WaSR-T (ResNet-101)Q60.1Unverified
2TMANet (ResNet-50)Q57.5Unverified
3CSANet (ResNet-101)Q49.1Unverified
#ModelMetricClaimedVerifiedStatus
1MVNet(DeepLabV3)mIoU54.52Unverified
2MVNet(PSPNet)mIoU54.36Unverified
3MVNet(FCN)mIoU53.9Unverified