SOTAVerified

Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Showing 851895 of 895 papers

TitleStatusHype
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-trainingCode0
Local Memory Attention for Fast Video Semantic SegmentationCode0
Ground-truth or DAER: Selective Re-query of Secondary InformationCode0
DaBiT: Depth and Blur informed Transformer for Joint Refocusing and Super-ResolutionCode0
D3S -- A Discriminative Single Shot Segmentation TrackerCode0
Video Object Segmentation With Dynamic Memory Networks and Adaptive Object AlignmentCode0
Unsupervised Online Video Object Segmentation with Motion Property UnderstandingCode0
Spatiotemporal CNN for Video Object SegmentationCode0
arcjetCV: an open-source software to analyze material ablationCode0
Annolid: Annotate, Segment, and Track Anything You NeedCode0
Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic SegmentationCode0
AGSS-VOS: Attention Guided Single-Shot Video Object SegmentationCode0
Leveraging Vision-Language Models for Open-Vocabulary Instance Segmentation and TrackingCode0
Unsupervised Video Object Segmentation for Deep Reinforcement LearningCode0
A Generative Appearance Model for End-to-end Video Object SegmentationCode0
Learning Video Object Segmentation from Static ImagesCode0
Stable Mean Teacher for Semi-supervised Video Action DetectionCode0
Learning Unsupervised Video Object Segmentation Through Visual AttentionCode0
Curriculum Learning for Recurrent Video Object SegmentationCode0
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
STFCN: Spatio-Temporal FCN for Semantic Video SegmentationCode0
Stochastic Block Models are a Discrete Surface TensionCode0
Representation Recycling for Streaming Video AnalysisCode0
Video Object Segmentation with Re-identificationCode0
An Image Processing Pipeline for Camera Trap Time-Lapse RecordingsCode0
Adaptive ROI Generation for Video Object Segmentation Using Reinforcement LearningCode0
Sub-token ViT Embedding via Stochastic Resonance TransformersCode0
When SAM2 Meets Video Shadow and Mirror DetectionCode0
CRVOS: Clue Refining Network for Video Object SegmentationCode0
UVid-Net: Enhanced Semantic Segmentation of UAV Aerial Videos by Embedding Temporal InformationCode0
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited SamplesCode0
Learning Correspondence from the Cycle-Consistency of TimeCode0
Infer from What You Have Seen Before: Temporally-dependent Classifier for Semi-supervised Video SegmentationCode0
Continuous Spatio-Temporal Memory Networks for 4D Cardiac Cine MRI SegmentationCode0
Implicit Motion-Compensated Network for Unsupervised Video Object SegmentationCode0
Hybrid-S2S: Video Object Segmentation with Recurrent Networks and Correspondence MatchingCode0
Adaptive Memory Management for Video Object SegmentationCode0
Holistic Prototype Attention Network for Few-Shot VOSCode0
Hierarchical Graph Pattern Understanding for Zero-Shot VOSCode0
Temporal-aware Hierarchical Mask Classification for Video Semantic SegmentationCode0
Vanishing-Point-Guided Video Semantic Segmentation of Driving ScenesCode0
GSVNet: Guided Spatially-Varying Convolution for Fast Semantic Segmentation on VideoCode0
Co-attention Propagation Network for Zero-Shot Video Object SegmentationCode0
VideoSAM: A Large Vision Foundation Model for High-Speed Video SegmentationCode0
GL-Fusion: Global-Local Fusion Network for Multi-view Echocardiogram Video SegmentationCode0
Show:102550
← PrevPage 18 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TMANet-50mIoU80.3Unverified
2TDNet-50 [9]mIoU79.9Unverified
3DeltaDist-DDRNet-39mIoU79.9Unverified
4PSPNet-101 [20]mIoU79.7Unverified
5PSPNet-50 [20]mIoU78.1Unverified
6LVS [12]mIoU76.8Unverified
7GRFP [15]mIoU73.6Unverified
8FCN-50 [14]mIoU70.1Unverified
9DFF [22]mIoU69.2Unverified
#ModelMetricClaimedVerifiedStatus
1TMANet-50Mean IoU76.5Unverified
2ETC-MobileNetMean IoU76.3Unverified
3TDNet-50Mean IoU76.2Unverified
4PSPNet-50Mean IoU76Unverified
5NetwarpMean IoU74.7Unverified
6GRFPMean IoU67.1Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mIoU63.8Unverified
2UniVS(Swin-L)mIoU59.8Unverified
3Tube-Link(Swin-large)mIoU59.6Unverified
4MRCFA(MiT-B5)mIoU49.9Unverified
5CFFM(MiT-B5)mIoU49.3Unverified
#ModelMetricClaimedVerifiedStatus
1WaSR-T (ResNet-101)Q60.1Unverified
2TMANet (ResNet-50)Q57.5Unverified
3CSANet (ResNet-101)Q49.1Unverified
#ModelMetricClaimedVerifiedStatus
1MVNet(DeepLabV3)mIoU54.52Unverified
2MVNet(PSPNet)mIoU54.36Unverified
3MVNet(FCN)mIoU53.9Unverified