Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–375 of 895 papers

Title	Date	Tasks	Status	Hype
Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object Segmentation	Mar 25, 2023	Semantic SegmentationVideo Object Segmentation	CodeCode Available	1
CrOC: Cross-View Online Clustering for Dense Visual Representation Learning	Mar 23, 2023	ClusteringOnline Clustering	CodeCode Available	1
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation	Mar 22, 2023	Contrastive LearningSegmentation	CodeCode Available	1
Two-shot Video Object Segmentation	Mar 21, 2023	ObjectPseudo Label	CodeCode Available	1
Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation	Mar 18, 2023	ObjectOptical Flow Estimation	CodeCode Available	1
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation	Mar 17, 2023	SegmentationSelf-Supervised Learning	CodeCode Available	0
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation	Mar 16, 2023	Knowledge DistillationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Guided Slot Attention for Unsupervised Video Object Segmentation	Mar 15, 2023	ObjectSemantic Segmentation	CodeCode Available	1
InstMove: Instance Motion for Object-centric Video Segmentation	Mar 14, 2023	ObjectOptical Flow Estimation	CodeCode Available	2
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified	0
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos	Mar 13, 2023	SegmentationSemantic Segmentation	CodeCode Available	1
Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation	Mar 8, 2023	DecoderObject	—Unverified	0
A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design	Mar 8, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified	0
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified	0
Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation	Feb 22, 2023	DecoderImage Segmentation	CodeCode Available	1
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation	Feb 14, 2023	DecoderImage Segmentation	CodeCode Available	1
Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction	Feb 7, 2023	Instance SegmentationMulti-Object Tracking	CodeCode Available	1
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes	Feb 3, 2023	ObjectSegmentation	CodeCode Available	2
Audio-Visual Segmentation with Semantics	Jan 30, 2023	SegmentationSemantic Segmentation	CodeCode Available	2
Approximating DTW with a convolutional neural network on EEG data	Jan 30, 2023	Anomaly DetectionComputational Efficiency	—Unverified	0
Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object Segmentation	Jan 29, 2023	Instance SegmentationObject	—Unverified	0
Flow-guided Semi-supervised Video Object Segmentation	Jan 25, 2023	DecoderObject	—Unverified	0
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified	0
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement	Jan 10, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	0

Show:10 25 50

← PrevPage 15 of 36Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified