Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 147 papers

Title	Date	Tasks	Status	Hype	Score
SAM 2: Segment Anything in Images and Videos	Aug 1, 2024	Image SegmentationRobot Manipulation Generalization	CodeCode Available	12	5
A Distractor-Aware Memory for Visual Object Tracking with SAM2	Nov 26, 2024	Object TrackingSemi-Supervised Video Object Segmentation	CodeCode Available	3	5
Putting the Object Back into Video Object Segmentation	Oct 19, 2023	ObjectSegmentation	CodeCode Available	3	5
XMem: Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model	Jul 14, 2022	2D Human Pose Estimation2D Object Detection	CodeCode Available	3	5
Tracking Anything with Decoupled Video Segmentation	Sep 7, 2023	Open-Vocabulary Video SegmentationOpen-World Video Segmentation	CodeCode Available	3	5
Fast Online Object Tracking and Segmentation: A Unifying Approach	Dec 12, 2018	ObjectObject Tracking	CodeCode Available	2	5
XMem++: Production-level Video Segmentation From Few Annotated Frames	Jul 29, 2023	SegmentationSemantic Segmentation	CodeCode Available	2	5
Decoupling Features in Hierarchical Propagation for Video Object Segmentation	Oct 18, 2022	ObjectSemantic Segmentation	CodeCode Available	2	5
MixFormer: End-to-End Tracking with Iterative Mixed Attention	Mar 21, 2022	Semi-Supervised Video Object SegmentationVideo Object Tracking	CodeCode Available	2	5
Tracking Anything in High Quality	Jul 26, 2023	ObjectObject Tracking	CodeCode Available	2	5
ODTrack: Online Dense Temporal Token Learning for Visual Tracking	Jan 3, 2024	Semi-Supervised Video Object SegmentationVideo Object Tracking	CodeCode Available	2	5
Efficient Video Object Segmentation via Modulated Cross-Attention Memory	Mar 26, 2024	GPUObject	CodeCode Available	2	5
Scalable Video Object Segmentation with Identification Mechanism	Mar 22, 2022	ObjectSegmentation	CodeCode Available	2	5
Exploring Enhanced Contextual Information for Video-Level Object Tracking	Dec 15, 2024	ObjectObject Tracking	CodeCode Available	2	5
Video Object Segmentation in Panoptic Wild Scenes	May 8, 2023	ObjectSemantic Segmentation	CodeCode Available	2	5
A Transductive Approach for Video Object Segmentation	Apr 15, 2020	Instance SegmentationObject	CodeCode Available	1	5
Associating Objects with Transformers for Video Object Segmentation	Jun 4, 2021	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Learning to Learn Better for Video Object Segmentation	Dec 5, 2022	Inductive LearningObject	CodeCode Available	1	5
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object Segmentation	Jun 9, 2021	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	1	5
Reliable Propagation-Correction Modulation for Video Object Segmentation	Dec 6, 2021	ObjectSemantic Segmentation	CodeCode Available	1	5
Self-Supervised Video Object Segmentation by Motion-Aware Mask Propagation	Jul 27, 2021	SegmentationSemantic Segmentation	CodeCode Available	1	5
SSTVOS: Sparse Spatiotemporal Transformers for Video Object Segmentation	Jan 21, 2021	Inductive BiasMotion Segmentation	CodeCode Available	1	5
Learning Quality-aware Dynamic Memory for Video Object Segmentation	Jul 16, 2022	SegmentationSemantic Segmentation	CodeCode Available	1	5
Recurrent Dynamic Embedding for Video Object Segmentation	May 8, 2022	ObjectSemantic Segmentation	CodeCode Available	1	5
Per-Clip Video Object Segmentation	Aug 3, 2022	ObjectSegmentation	CodeCode Available	1	5
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background Integration	Oct 13, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Fast Template Matching and Update for Video Object Tracking and Segmentation	Apr 16, 2020	Object Trackingreinforcement-learning	CodeCode Available	1	5
Pixel-Level Bijective Matching for Video Object Segmentation	Oct 4, 2021	ObjectSemantic Segmentation	CodeCode Available	1	5
Delving into the Cyclic Mechanism in Semi-supervised Video Object Segmentation	Oct 23, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Dense Unsupervised Learning for Video Segmentation	Nov 11, 2021	SegmentationSemantic Segmentation	CodeCode Available	1	5
MAST: A Memory-Augmented Self-supervised Tracker	Feb 18, 2020	Semantic SegmentationSemi-Supervised Video Object Segmentation	CodeCode Available	1	5
FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical Flow	Nov 20, 2021	Optical Flow EstimationSegmentation	CodeCode Available	1	5
Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic Perspective	Nov 2, 2021	SegmentationSemantic Segmentation	CodeCode Available	1	5
Alpha-Refine: Boosting Tracking Performance by Precise Bounding Box Estimation	Dec 12, 2020	Object TrackingSemi-Supervised Video Object Segmentation	CodeCode Available	1	5
Accelerating Video Object Segmentation with Compressed Video	Jul 26, 2021	ObjectSegmentation	CodeCode Available	1	5
Lester: rotoscope animation through video object segmentation and tracking	Feb 15, 2024	3D Human Pose EstimationObject	CodeCode Available	1	5
Hierarchical Memory Matching Network for Video Object Segmentation	Sep 23, 2021	ObjectRetrieval	CodeCode Available	1	5
Learning What to Learn for Video Object Segmentation	Mar 25, 2020	Few-Shot LearningObject	CodeCode Available	1	5
LiVOS: Light Video Object Segmentation with Gated Linear Matching	Nov 5, 2024	GPUSemantic Segmentation	CodeCode Available	1	5
Efficient Regional Memory Network for Video Object Segmentation	Mar 24, 2021	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Kernelized Memory Network for Video Object Segmentation	Jul 16, 2020	ObjectSemantic Segmentation	CodeCode Available	1	5
Do Different Tracking Tasks Require Different Appearance Models?	Jul 5, 2021	Multi-Object TrackingMulti-Object Tracking and Segmentation	CodeCode Available	1	5
Joint Inductive and Transductive Learning for Video Object Segmentation	Aug 8, 2021	Inductive LearningObject	CodeCode Available	1	5
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware Fusion	Mar 14, 2021	Interactive Video Object SegmentationSemantic Segmentation	CodeCode Available	1	5
Fast Video Object Segmentation using the Global Context Module	Jan 30, 2020	ObjectSegmentation	CodeCode Available	1	5
One-Shot Video Object Segmentation	Nov 16, 2016	Foreground SegmentationObject	CodeCode Available	1	5
Learning Dynamic Network Using a Reuse Gate Function in Semi-supervised Video Object Segmentation	Dec 21, 2020	One-shot visual object segmentationSegmentation	CodeCode Available	1	5
Learning Fast and Robust Target Models for Video Object Segmentation	Feb 27, 2020	One-shot visual object segmentationSegmentation	CodeCode Available	1	5
Collaborative Video Object Segmentation by Foreground-Background Integration	Mar 18, 2020	ObjectOne-shot visual object segmentation	CodeCode Available	1	5
Directional Deep Embedding and Appearance Learning for Fast Video Object Segmentation	Feb 17, 2020	GPUOne-shot visual object segmentation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 1 of 3Next →

All datasets DAVIS 2017 (val)DAVIS 2016 DAVIS-2017 (test-dev)YouTube-VOS 2018 DAVIS (no YouTube-VOS training)YouTube-VOS 2019 VOT2020 MOSE Long Video Dataset YouTube DAVIS 2017 BURST-test

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SAM2	J&F	90.7	—	Unverified
2	Cutie+ (base)	J&F	90.5	—	Unverified
3	ISVOS (BL30K, MS)	J&F	89.8	—	Unverified
4	XMem (BL30K, MS)	J&F	89.5	—	Unverified
5	ISVOS (MS)	J&F	88.6	—	Unverified
6	ISVOS (BL30K)	J&F	88.2	—	Unverified
7	XMem (MS)	J&F	88.2	—	Unverified
8	Cutie+ (base, MEGA)	J&F	88.1	—	Unverified
9	JIMD	J&F	88.1	—	Unverified
10	Cutie (base)	J&F	87.9	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SwinB-AOST (L'=3, MS)	J&F	93	—	Unverified
2	SwinB-AOTv2-L (MS)	J&F	93	—	Unverified
3	SwinB-DeAOT-L	J&F	92.9	—	Unverified
4	XMem (MS)	J&F	92.7	—	Unverified
5	SwinB-AOTv2-L	J&F	92.4	—	Unverified
6	SwinB-AOST (L'=3)	J&F	92.4	—	Unverified
7	R50-DeAOT-L	J&F	92.3	—	Unverified
8	R50-AOST (L'=3)	J&F	92.1	—	Unverified
9	QDMN	J&F	92	—	Unverified
10	DeAOT-L	J&F	92	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Cutie+ (base, MEGA)	J&F	88.1	—	Unverified
2	Cutie (base, MEGA)	J&F	86.1	—	Unverified
3	Cutie+ (base)	J&F	85.9	—	Unverified
4	SwinB-AOST (L'=3, MS)	J&F	84.7	—	Unverified
5	SwinB-AOTv2-L	J&F	84.5	—	Unverified
6	JIMD-R50	J&F	83.9	—	Unverified
7	XMem (BL30K, MS)	J&F	83.7	—	Unverified
8	DEVA	J&F	83.2	—	Unverified
9	XMem (MS)	J&F	83.1	—	Unverified
10	SwinB-DeAOT-L	J&F	82.8	—	Unverified