Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 426–450 of 895 papers

Title	Date	Tasks	Status
Every Frame Counts: Joint Learning of Video Segmentation and Optical Flow	Nov 28, 2019	Optical Flow EstimationSegmentation	—Unverified
Explore Synergistic Interaction Across Frames for Interactive Video Object Segmentation	Jan 23, 2024	Interactive Video Object SegmentationSemantic Segmentation	—Unverified
Eye Tracking Assisted Extraction of Attentionally Important Objects From Videos	Jun 1, 2015	ClusteringObject	—Unverified
F2Net: Learning to Focus on the Foreground for Unsupervised Video Object Segmentation	Dec 4, 2020	Semantic SegmentationUnsupervised Video Object Segmentation	—Unverified
Real-Time Segmentation Networks should be Latency Aware	Apr 6, 2020	Autonomous VehiclesScene Segmentation	—Unverified
Fast Action Proposals for Human Action Detection and Search	Jun 1, 2015	Action DetectionVideo Segmentation	—Unverified
Fast Sprite Decomposition from Animated Graphics	Aug 7, 2024	Semantic SegmentationVideo Object Segmentation	—Unverified
Fast Video Object Segmentation via Dynamic Targeting Network	Oct 1, 2019	ObjectSegmentation	—Unverified
Fast Video Object Segmentation via Mask Transfer Network	Aug 28, 2019	ObjectSemantic Segmentation	—Unverified
Fast video object segmentation with Spatio-Temporal GANs	Mar 28, 2019	DescriptiveObject	—Unverified
Fast Video Object Segmentation With Temporal Aggregation Network and Dynamic Template Matching	Jul 11, 2020	ObjectObject Tracking	—Unverified
FlowCut: Unsupervised Video Instance Segmentation via Temporal Mask Matching	May 19, 2025	Instance SegmentationSegmentation	—Unverified
Flow-free Video Object Segmentation	Jun 29, 2017	ClusteringObject	—Unverified
Flow-guided Semi-supervised Video Object Segmentation	Jan 25, 2023	DecoderObject	—Unverified
FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation	Nov 20, 2021	ObjectOptical Flow Estimation	—Unverified
FODVid: Flow-guided Object Discovery in Videos	Jul 10, 2023	ObjectObject Discovery	—Unverified
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy Object Models	Jun 10, 2016	ObjectObject Tracking	—Unverified
FoodMem: Near Real-time and Precise Food Video Segmentation	Jul 16, 2024	SegmentationSemantic Segmentation	—Unverified
Fully Automated 2D and 3D Convolutional Neural Networks Pipeline for Video Segmentation and Myocardial Infarction Detection in Echocardiography	Mar 26, 2021	Binary ClassificationMyocardial infarction detection	—Unverified
Fully Connected Object Proposals for Video Segmentation	Dec 1, 2015	ObjectSegmentation	—Unverified
Fully Hyperbolic Convolutional Neural Networks	May 24, 2019	Depth EstimationGeneral Classification	—Unverified
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation	Sep 21, 2023	ObjectReferring Video Object Segmentation	—Unverified
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos	Jan 19, 2017	SegmentationStructured Prediction	—Unverified
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos	Jul 1, 2017	SegmentationStructured Prediction	—Unverified
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution	Apr 13, 2025	SegmentationSemantic Segmentation	—Unverified

Show:10 25 50

← PrevPage 18 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
3	TDNet-50 [9]	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified