Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–625 of 895 papers

Title	Date	Tasks	Status
FODVid: Flow-guided Object Discovery in Videos	Jul 10, 2023	ObjectObject Discovery	—Unverified
FOMTrace: Interactive Video Segmentation By Image Graphs and Fuzzy Object Models	Jun 10, 2016	ObjectObject Tracking	—Unverified
FoodMem: Near Real-time and Precise Food Video Segmentation	Jul 16, 2024	SegmentationSemantic Segmentation	—Unverified
Fully Automated 2D and 3D Convolutional Neural Networks Pipeline for Video Segmentation and Myocardial Infarction Detection in Echocardiography	Mar 26, 2021	Binary ClassificationMyocardial infarction detection	—Unverified
Fully Connected Object Proposals for Video Segmentation	Dec 1, 2015	ObjectSegmentation	—Unverified
Fully Hyperbolic Convolutional Neural Networks	May 24, 2019	Depth EstimationGeneral Classification	—Unverified
Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation	Sep 21, 2023	ObjectReferring Video Object Segmentation	—Unverified
FusionSeg: Learning to combine motion and appearance for fully automatic segmention of generic objects in videos	Jan 19, 2017	SegmentationStructured Prediction	—Unverified
FusionSeg: Learning to Combine Motion and Appearance for Fully Automatic Segmentation of Generic Objects in Videos	Jul 1, 2017	SegmentationStructured Prediction	—Unverified
FVOS for MOSE Track of 4th PVUW Challenge: 3rd Place Solution	Apr 13, 2025	SegmentationSemantic Segmentation	—Unverified
Gamifying Video Object Segmentation	Jan 5, 2016	Interactive Video Object SegmentationObject	—Unverified
GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting	Nov 12, 2024	3DGSgraph construction	—Unverified
GenDeF: Learning Generative Deformation Field for Video Generation	Dec 7, 2023	DisentanglementVideo Editing	—Unverified
Generalized Product-of-Experts for Learning Multimodal Representations in Noisy Environments	Nov 7, 2022	3D Hand Pose EstimationHand Pose Estimation	—Unverified
Generating Masks from Boxes by Mining Spatio-Temporal Consistencies in Videos	Jan 6, 2021	ObjectSegmentation	—Unverified
Generative Video Propagation	Dec 27, 2024	Image to Video GenerationVideo Generation	—Unverified
Geodesic Distance Histogram Feature for Video Segmentation	Mar 31, 2017	SegmentationSuperpixels	—Unverified
Geometric Algebra Planes: Convex Implicit Neural Volumes	Nov 20, 2024	DecoderVideo Segmentation	—Unverified
Geometric Context from Videos	Oct 25, 2015	SegmentationVideo Segmentation	—Unverified
Global Motion Understanding in Large-Scale Video Object Segmentation	May 11, 2024	Instance SegmentationOptical Flow Estimation	—Unverified
Global Optimality Guarantees for Nonconvex Unsupervised Video Segmentation	Jul 9, 2019	ObjectSegmentation	—Unverified
GroPrompt: Efficient Grounded Prompting and Adaptation for Referring Video Object Segmentation	Jun 18, 2024	Contrastive LearningObject	—Unverified
Grouping-Based Low-Rank Trajectory Completion and 3D Reconstruction	Dec 1, 2014	3D ReconstructionClustering	—Unverified
Guess What Moves: Unsupervised Video and Image Segmentation by Anticipating Motion	May 16, 2022	Image SegmentationOptical Flow Estimation	—Unverified
HD-EPIC: A Highly-Detailed Egocentric Video Dataset	Feb 6, 2025	Action RecognitionNutrition	—Unverified

Show:10 25 50

← PrevPage 25 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified