Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 801–825 of 895 papers

Title	Date	Tasks	Status
Fast Pixel-Matching for Video Object Segmentation	Jul 9, 2021	ObjectSemantic Segmentation	CodeCode Available
SegFlow: Joint Learning for Video Object Segmentation and Optical Flow	Sep 20, 2017	Image SegmentationObject	CodeCode Available
Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation	Jul 15, 2023	DecoderSegmentation	CodeCode Available
Fast Interactive Video Object Segmentation with Graph Neural Networks	Mar 5, 2021	Graph Neural NetworkInteractive Video Object Segmentation	CodeCode Available
Multigrid Predictive Filter Flow for Unsupervised Learning on Videos	Apr 2, 2019	Optical Flow EstimationPose Tracking	CodeCode Available
Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy	Jan 6, 2025	Video SegmentationVideo Semantic Segmentation	CodeCode Available
Analyzing Linear Dynamical Systems: From Modeling to Coding and Learning	Aug 3, 2016	Dictionary LearningGeneral Classification	CodeCode Available
Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation	Sep 20, 2023	Image SegmentationSegmentation	CodeCode Available
Fast and Accurate Online Video Object Segmentation via Tracking Parts	Jun 6, 2018	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available
Exploiting Temporality for Semi-Supervised Video Segmentation	Aug 29, 2019	DecoderImage Segmentation	CodeCode Available
Multi-Context Temporal Consistent Modeling for Referring Video Object Segmentation	Jan 9, 2025	Referring Video Object SegmentationSemantic Segmentation	CodeCode Available
Self-supervised Amodal Video Object Segmentation	Oct 23, 2022	ObjectSegmentation	CodeCode Available
Self-supervised Learning for Video Correspondence Flow	May 2, 2019	Self-Supervised LearningSemi-Supervised Video Object Segmentation	CodeCode Available
Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation	Aug 8, 2023	Contrastive LearningObject	CodeCode Available
MSN: Efficient Online Mask Selection Network for Video Instance Segmentation	Jun 19, 2021	Instance SegmentationSegmentation	CodeCode Available
MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data	Nov 12, 2024	SegmentationUncertainty Quantification	CodeCode Available
Self-supervised Video Object Segmentation	Jun 22, 2020	ObjectOne-shot visual object segmentation	CodeCode Available
Borrowing from yourself: Faster future video segmentation with partial channel update	Feb 11, 2022	Future predictionSemantic Segmentation	CodeCode Available
MHP-VOS: Multiple Hypotheses Propagation for Video Object Segmentation	Apr 17, 2019	Decision MakingObject	CodeCode Available
MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation	Sep 21, 2023	Domain AdaptationImage Segmentation	CodeCode Available
Efficient Video Object Segmentation via Network Modulation	Feb 4, 2018	ObjectSegmentation	CodeCode Available
Meta Learning Deep Visual Words for Fast Video Object Segmentation	Dec 4, 2018	Meta-LearningObject	CodeCode Available
Trusted Video Inpainting Localization via Deep Attentive Noise Learning	Jun 19, 2024	Semantic SegmentationVideo Inpainting	CodeCode Available
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object Segmentation	Jul 31, 2024	ObjectSegmentation	CodeCode Available
Efficient Frame Extraction: A Novel Approach Through Frame Similarity and Surgical Tool Tracking for Video Segmentation	Jan 19, 2025	Video SegmentationVideo Semantic Segmentation	CodeCode Available

Show:10 25 50

← PrevPage 33 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified