Video Semantic Segmentation

The goal of video semantic segmentation is to assign a predefined class to each pixel in all frames of a video. This requires the model not only to predict accurate segmentation masks but also to ensure that these masks remain temporally consistent across frames. This task has broad applications in areas such as autonomous driving, medical video analysis, and AR/VR.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 351–400 of 895 papers

Title	Date	Tasks	Status	Hype
Spatio-Temporal Pixel-Level Contrastive Learning-based Source-Free Domain Adaptation for Video Semantic Segmentation	Mar 25, 2023	Contrastive LearningDomain Adaptation	CodeCode Available	0
CrOC: Cross-View Online Clustering for Dense Visual Representation Learning	Mar 23, 2023	ClusteringOnline Clustering	CodeCode Available	1
Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation	Mar 22, 2023	Contrastive LearningSegmentation	CodeCode Available	1
Two-shot Video Object Segmentation	Mar 21, 2023	ObjectPseudo Label	CodeCode Available	1
Adaptive Multi-source Predictor for Zero-shot Video Object Segmentation	Mar 18, 2023	ObjectOptical Flow Estimation	CodeCode Available	1
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video Segmentation	Mar 17, 2023	SegmentationSelf-Supervised Learning	CodeCode Available	0
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation	Mar 16, 2023	Knowledge DistillationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Guided Slot Attention for Unsupervised Video Object Segmentation	Mar 15, 2023	ObjectSemantic Segmentation	CodeCode Available	1
InstMove: Instance Motion for Object-centric Video Segmentation	Mar 14, 2023	ObjectOptical Flow Estimation	CodeCode Available	2
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified	0
Efficient Semantic Segmentation by Altering Resolutions for Compressed Videos	Mar 13, 2023	SegmentationSemantic Segmentation	CodeCode Available	1
Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation	Mar 8, 2023	DecoderObject	—Unverified	0
A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design	Mar 8, 2023	Autonomous DrivingAutonomous Vehicles	—Unverified	0
Learning to Adapt to Online Streams with Distribution Shifts	Mar 2, 2023	BenchmarkingMeta-Learning	—Unverified	0
One-Shot Video Inpainting	Feb 28, 2023	ObjectSegmentation	—Unverified	0
Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation	Feb 22, 2023	DecoderImage Segmentation	CodeCode Available	1
PolyFormer: Referring Image Segmentation as Sequential Polygon Generation	Feb 14, 2023	DecoderImage Segmentation	CodeCode Available	1
Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot Interaction	Feb 7, 2023	Instance SegmentationMulti-Object Tracking	CodeCode Available	1
MOSE: A New Dataset for Video Object Segmentation in Complex Scenes	Feb 3, 2023	ObjectSegmentation	CodeCode Available	2
Audio-Visual Segmentation with Semantics	Jan 30, 2023	SegmentationSemantic Segmentation	CodeCode Available	2
Approximating DTW with a convolutional neural network on EEG data	Jan 30, 2023	Anomaly DetectionComputational Efficiency	—Unverified	0
Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object Segmentation	Jan 29, 2023	Instance SegmentationObject	—Unverified	0
Flow-guided Semi-supervised Video Object Segmentation	Jan 25, 2023	DecoderObject	—Unverified	0
A Comprehensive Review of Modern Object Segmentation Approaches	Jan 13, 2023	Image SegmentationObject	—Unverified	0
Video Semantic Segmentation with Inter-Frame Feature Fusion and Inner-Frame Feature Refinement	Jan 10, 2023	Optical Flow EstimationSemantic Segmentation	CodeCode Available	0
TarViS: A Unified Approach for Target-based Video Segmentation	Jan 6, 2023	Instance SegmentationPanoptic Segmentation	CodeCode Available	1
Object Segmentation with Audio Context	Jan 4, 2023	audio-visual learningDecoder	—Unverified	0
SegGPT: Towards Segmenting Everything in Context	Jan 1, 2023	Few-Shot Semantic SegmentationIn-Context Learning	—Unverified	0
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning	Jan 1, 2023	ObjectPseudo Label	—Unverified	0
Video State-Changing Object Segmentation	Jan 1, 2023	ObjectRepresentation Learning	CodeCode Available	0
Robust Referring Video Object Segmentation with Cyclic Structural Consensus	Jan 1, 2023	ObjectReferring Video Object Segmentation	—Unverified	0
HTML: Hybrid Temporal-scale Multimodal Learning Framework for Referring Video Object Segmentation	Jan 1, 2023	multimodal interactionObject	—Unverified	0
Video Object Segmentation-aware Video Frame Interpolation	Jan 1, 2023	ObjectPose Estimation	CodeCode Available	1
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation	Jan 1, 2023	RetrievalSemantic Segmentation	—Unverified	0
Segment Every Reference Object in Spatial and Temporal Spaces	Jan 1, 2023	Image SegmentationObject	—Unverified	0
End-to-End Video Matting With Trimap Propagation	Jan 1, 2023	Image MattingSegmentation	CodeCode Available	1
Multispectral Video Semantic Segmentation: A Benchmark Dataset and Baseline	Jan 1, 2023	SegmentationSemantic Segmentation	CodeCode Available	1
Simultaneously Short- and Long-Term Temporal Modeling for Semi-Supervised Video Semantic Segmentation	Jan 1, 2023	Pseudo LabelSemantic Segmentation	—Unverified	0
Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation	Jan 1, 2023	Instance SegmentationMulti-Object Tracking	CodeCode Available	1
NewsNet: A Novel Dataset for Hierarchical Temporal Segmentation	Jan 1, 2023	Video SegmentationVideo Semantic Segmentation	CodeCode Available	0
A Class-wise Non-salient Region Generalized Framework for Video Semantic Segmentation	Dec 29, 2022	Domain GeneralizationSegmentation	—Unverified	0
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object Segmentation	Dec 27, 2022	ObjectReferring Video Object Segmentation	CodeCode Available	1
Video Segmentation Learning Using Cascade Residual Convolutional Neural Network	Dec 20, 2022	Action RecognitionAnomaly Detection	—Unverified	0
Improving Unsupervised Video Object Segmentation with Motion-Appearance Synergy	Dec 17, 2022	MisconceptionsObject	—Unverified	0
Learning a Fast 3D Spectral Approach to Object Segmentation and Tracking over Space and Time	Dec 15, 2022	ClusteringGPU	—Unverified	0
Look Before You Match: Instance Understanding Matters in Video Object Segmentation	Dec 13, 2022	Instance SegmentationSegmentation	—Unverified	0
Breaking the "Object" in Video Object Segmentation	Dec 12, 2022	ObjectSemantic Segmentation	—Unverified	0
Tencent AVS: A Holistic Ads Video Dataset for Multi-modal Scene Segmentation	Dec 9, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	—Unverified	0
Video Object of Interest Segmentation	Dec 6, 2022	DecoderObject	—Unverified	0
Learning to Learn Better for Video Object Segmentation	Dec 5, 2022	Inductive LearningObject	CodeCode Available	1

Show:10 25 50

← PrevPage 8 of 18Next →

All datasets Cityscapes val CamVid VSPW LaRS Multispectral Video Semantic Segmentation

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	mIoU	80.3	—	Unverified
2	TDNet-50 [9]	mIoU	79.9	—	Unverified
3	DeltaDist-DDRNet-39	mIoU	79.9	—	Unverified
4	PSPNet-101 [20]	mIoU	79.7	—	Unverified
5	PSPNet-50 [20]	mIoU	78.1	—	Unverified
6	LVS [12]	mIoU	76.8	—	Unverified
7	GRFP [15]	mIoU	73.6	—	Unverified
8	FCN-50 [14]	mIoU	70.1	—	Unverified
9	DFF [22]	mIoU	69.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	TMANet-50	Mean IoU	76.5	—	Unverified
2	ETC-MobileNet	Mean IoU	76.3	—	Unverified
3	TDNet-50	Mean IoU	76.2	—	Unverified
4	PSPNet-50	Mean IoU	76	—	Unverified
5	Netwarp	Mean IoU	74.7	—	Unverified
6	GRFP	Mean IoU	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DVIS++(VIT-L)	mIoU	63.8	—	Unverified
2	UniVS(Swin-L)	mIoU	59.8	—	Unverified
3	Tube-Link(Swin-large)	mIoU	59.6	—	Unverified
4	MRCFA(MiT-B5)	mIoU	49.9	—	Unverified
5	CFFM(MiT-B5)	mIoU	49.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	WaSR-T (ResNet-101)	Q	60.1	—	Unverified
2	TMANet (ResNet-50)	Q	57.5	—	Unverified
3	CSANet (ResNet-101)	Q	49.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MVNet(DeepLabV3)	mIoU	54.52	—	Unverified
2	MVNet(PSPNet)	mIoU	54.36	—	Unverified
3	MVNet(FCN)	mIoU	53.9	—	Unverified