Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 147 papers

Title	Date	Tasks	Status	Hype
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration	Oct 13, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1
Kernelized Memory Network for Video Object Segmentation	Jul 16, 2020	ObjectSemantic Segmentation	CodeCode Available	1
Fast Template Matching and Update for Video Object Tracking and Segmentation	Apr 16, 2020	Object Trackingreinforcement-learning	CodeCode Available	1
A Transductive Approach for Video Object Segmentation	Apr 15, 2020	Instance SegmentationObject	CodeCode Available	1
Learning What to Learn for Video Object Segmentation	Mar 25, 2020	Few-Shot LearningObject	CodeCode Available	1
Collaborative Video Object Segmentation by Foreground-Background Integration	Mar 18, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1
Learning Video Object Segmentation from Unlabeled Videos	Mar 10, 2020	ObjectRepresentation Learning	CodeCode Available	1
State-Aware Tracker for Real-Time Video Object Segmentation	Mar 1, 2020	SegmentationSemantic Segmentation	CodeCode Available	1
Learning Fast and Robust Target Models for Video Object Segmentation	Feb 27, 2020	One-shot visual object segmentationSegmentation	CodeCode Available	1
MAST: A Memory-Augmented Self-supervised Tracker	Feb 18, 2020	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	1
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation	Feb 17, 2020	GPUOne-shot visual object segmentation	CodeCode Available	1
Fast Video Object Segmentation using the Global Context Module	Jan 30, 2020	ObjectSegmentation	CodeCode Available	1
UnOVOST: Unsupervised Offline Video Object Segmentation and Tracking	Jan 15, 2020	ObjectSegmentation	CodeCode Available	1
Video Object Segmentation using Space-Time Memory Networks	Apr 1, 2019	Interactive Video Object SegmentationObject	CodeCode Available	1
YouTube-VOS: Sequence-to-Sequence Video Object Segmentation	Sep 3, 2018	Image SegmentationObject	CodeCode Available	1
One-Shot Video Object Segmentation	Nov 16, 2016	Foreground SegmentationObject	CodeCode Available	1
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation	Jun 7, 2025	SegmentationSemantic Segmentation	—Unverified	0
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation	Sep 22, 2024	Semantic SegmentationSemi-Supervised Video Object Segmentation	—Unverified	0
Global Motion Understanding in Large-Scale Video Object Segmentation	May 11, 2024	Instance SegmentationOptical Flow Estimation	—Unverified	0
Spatial-Temporal Multi-level Association for Video Object Segmentation	Apr 9, 2024	ObjectSegmentation	—Unverified	0
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution	Oct 23, 2023	ObjectSemantic Segmentation	—Unverified	0
Sub-token ViT Embedding via Stochastic Resonance Transformers	Oct 6, 2023	Depth EstimationDepth Prediction	CodeCode Available	0
Memory-Efficient Continual Learning Object Segmentation for Long Video	Sep 26, 2023	Continual LearningObject	—Unverified	0
Hierarchical Spatiotemporal Transformers for Video Object Segmentation	Jul 17, 2023	Inductive BiasObject	—Unverified	0
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation	Jul 5, 2023	ObjectPosition	—Unverified	0
TrickVOS: A Bag of Tricks for Video Object Segmentation	Jun 27, 2023	DecoderObject	—Unverified	0
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation	May 22, 2023	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	0
Robust and Efficient Memory Network for Video Object Segmentation	Apr 24, 2023	ObjectSemantic Segmentation	—Unverified	0
CLVOS23: A Long Video Object Segmentation Dataset for Continual Learning	Apr 9, 2023	Continual LearningSemantic Segmentation	CodeCode Available	0
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation	Mar 14, 2023	Contrastive LearningKnowledge Distillation	—Unverified	0
Flow-guided Semi-supervised Video Object Segmentation	Jan 25, 2023	DecoderObject	—Unverified	0
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation	Jan 1, 2023	RetrievalSemantic Segmentation	—Unverified	0
Look Before You Match: Instance Understanding Matters in Video Object Segmentation	Dec 13, 2022	Instance SegmentationSegmentation	—Unverified	0
Pixel-Level Equalized Matching for Video Object Segmentation	Sep 4, 2022	ObjectSemantic Segmentation	—Unverified	0
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation	Aug 1, 2022	ObjectOptical Flow Estimation	—Unverified	0
Region Aware Video Object Segmentation with Deep Motion Modeling	Jul 21, 2022	DecoderObject	—Unverified	0
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation	Jun 24, 2022	Objectobject-detection	—Unverified	0
Collaborative Attention Memory Network for Video Object Segmentation	May 17, 2022	ObjectSegmentation	—Unverified	0
Boosting Video Object Segmentation based on Scale Inconsistency	May 2, 2022	ObjectSemantic Segmentation	CodeCode Available	0
Adaptive Memory Management for Video Object Segmentation	Apr 13, 2022	ManagementObject	CodeCode Available	0
Siamese Network with Interactive Transformer for Video Object Segmentation	Dec 28, 2021	DecoderObject	CodeCode Available	0
MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation	Nov 29, 2021	ObjectSemantic Segmentation	—Unverified	0
FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation	Nov 20, 2021	ObjectOptical Flow Estimation	—Unverified	0
DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation	May 21, 2021	Domain AdaptationSemantic Segmentation	—Unverified	0
Learning Position and Target Consistency for Memory-based Video Object Segmentation	Apr 9, 2021	ObjectOne-shot visual object segmentation	—Unverified	0
Separable Structure Modeling for Semi-supervised Video Object Segmentation	Feb 18, 2021	ObjectOne-shot visual object segmentation	CodeCode Available	0
Video Object Segmentation With Dynamic Memory Networks and Adaptive Object Alignment	Jan 1, 2021	ObjectSemantic Segmentation	CodeCode Available	0
Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation	Dec 10, 2020	Graph Neural NetworkObject	—Unverified	0
PMVOS: Pixel-Level Matching-Based Video Object Segmentation	Sep 18, 2020	ObjectOne-shot visual object segmentation	—Unverified	0
LSMVOS: Long-Short-Term Similarity Matching for Video Object	Sep 2, 2020	ObjectOptical Flow Estimation	CodeCode Available	0

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets DAVIS 2017 (val)DAVIS 2016 DAVIS-2017 (test-dev)YouTube-VOS 2018 DAVIS (no YouTube-VOS training)YouTube-VOS 2019 VOT2020 MOSE Long Video Dataset YouTube DAVIS 2017 BURST-test

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SAM2	J&F	90.7	—	Unverified
2	Cutie+ (base)	J&F	90.5	—	Unverified
3	ISVOS (BL30K, MS)	J&F	89.8	—	Unverified
4	XMem (BL30K, MS)	J&F	89.5	—	Unverified
5	ISVOS (MS)	J&F	88.6	—	Unverified
6	ISVOS (BL30K)	J&F	88.2	—	Unverified
7	XMem (MS)	J&F	88.2	—	Unverified
8	Cutie+ (base, MEGA)	J&F	88.1	—	Unverified
9	JIMD	J&F	88.1	—	Unverified
10	Cutie (base)	J&F	87.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SwinB-AOST (L'=3, MS)	J&F	93	—	Unverified
2	SwinB-AOTv2-L (MS)	J&F	93	—	Unverified
3	SwinB-DeAOT-L	J&F	92.9	—	Unverified
4	XMem (MS)	J&F	92.7	—	Unverified
5	SwinB-AOTv2-L	J&F	92.4	—	Unverified
6	SwinB-AOST (L'=3)	J&F	92.4	—	Unverified
7	R50-DeAOT-L	J&F	92.3	—	Unverified
8	R50-AOST (L'=3)	J&F	92.1	—	Unverified
9	R50-AOST (L'=2)	J&F	92	—	Unverified
10	DeAOT-L	J&F	92	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Cutie+ (base, MEGA)	J&F	88.1	—	Unverified
2	Cutie (base, MEGA)	J&F	86.1	—	Unverified
3	Cutie+ (base)	J&F	85.9	—	Unverified
4	SwinB-AOST (L'=3, MS)	J&F	84.7	—	Unverified
5	SwinB-AOTv2-L	J&F	84.5	—	Unverified
6	JIMD-R50	J&F	83.9	—	Unverified
7	XMem (BL30K, MS)	J&F	83.7	—	Unverified
8	DEVA	J&F	83.2	—	Unverified
9	XMem (MS)	J&F	83.1	—	Unverified
10	SwinB-DeAOT-L	J&F	82.8	—	Unverified