SOTAVerified

Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Papers

Showing 51100 of 147 papers

TitleStatusHype
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background IntegrationCode1
Kernelized Memory Network for Video Object SegmentationCode1
Fast Template Matching and Update for Video Object Tracking and SegmentationCode1
A Transductive Approach for Video Object SegmentationCode1
Learning What to Learn for Video Object SegmentationCode1
Collaborative Video Object Segmentation by Foreground-Background IntegrationCode1
Learning Video Object Segmentation from Unlabeled VideosCode1
State-Aware Tracker for Real-Time Video Object SegmentationCode1
Learning Fast and Robust Target Models for Video Object SegmentationCode1
MAST: A Memory-Augmented Self-supervised TrackerCode1
Directional Deep Embedding and Appearance Learning for Fast Video Object SegmentationCode1
Fast Video Object Segmentation using the Global Context ModuleCode1
UnOVOST: Unsupervised Offline Video Object Segmentation and TrackingCode1
Video Object Segmentation using Space-Time Memory NetworksCode1
YouTube-VOS: Sequence-to-Sequence Video Object SegmentationCode1
One-Shot Video Object SegmentationCode1
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation0
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation0
Global Motion Understanding in Large-Scale Video Object Segmentation0
Spatial-Temporal Multi-level Association for Video Object Segmentation0
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution0
Sub-token ViT Embedding via Stochastic Resonance TransformersCode0
Memory-Efficient Continual Learning Object Segmentation for Long Video0
Hierarchical Spatiotemporal Transformers for Video Object Segmentation0
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation0
TrickVOS: A Bag of Tricks for Video Object Segmentation0
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object SegmentationCode0
Robust and Efficient Memory Network for Video Object Segmentation0
CLVOS23: A Long Video Object Segmentation Dataset for Continual LearningCode0
MobileVOS: Real-Time Video Object Segmentation Contrastive Learning meets Knowledge Distillation0
Flow-guided Semi-supervised Video Object Segmentation0
Alignment Before Aggregation: Trajectory Memory Retrieval Network for Video Object Segmentation0
Look Before You Match: Instance Understanding Matters in Video Object Segmentation0
Pixel-Level Equalized Matching for Video Object Segmentation0
BATMAN: Bilateral Attention Transformer in Motion-Appearance Neighboring Space for Video Object Segmentation0
Region Aware Video Object Segmentation with Deep Motion Modeling0
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation0
Collaborative Attention Memory Network for Video Object Segmentation0
Boosting Video Object Segmentation based on Scale InconsistencyCode0
Adaptive Memory Management for Video Object SegmentationCode0
Siamese Network with Interactive Transformer for Video Object SegmentationCode0
MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation0
FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation0
DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation0
Learning Position and Target Consistency for Memory-based Video Object Segmentation0
Separable Structure Modeling for Semi-supervised Video Object SegmentationCode0
Video Object Segmentation With Dynamic Memory Networks and Adaptive Object AlignmentCode0
Spatiotemporal Graph Neural Network based Mask Reconstruction for Video Object Segmentation0
PMVOS: Pixel-Level Matching-Based Video Object Segmentation0
LSMVOS: Long-Short-Term Similarity Matching for Video ObjectCode0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SAM2J&F90.7Unverified
2Cutie+ (base)J&F90.5Unverified
3ISVOS (BL30K, MS)J&F89.8Unverified
4XMem (BL30K, MS)J&F89.5Unverified
5ISVOS (MS)J&F88.6Unverified
6ISVOS (BL30K)J&F88.2Unverified
7XMem (MS)J&F88.2Unverified
8Cutie+ (base, MEGA)J&F88.1Unverified
9JIMDJ&F88.1Unverified
10Cutie (base)J&F87.9Unverified
#ModelMetricClaimedVerifiedStatus
1SwinB-AOST (L'=3, MS)J&F93Unverified
2SwinB-AOTv2-L (MS)J&F93Unverified
3SwinB-DeAOT-LJ&F92.9Unverified
4XMem (MS)J&F92.7Unverified
5SwinB-AOTv2-LJ&F92.4Unverified
6SwinB-AOST (L'=3)J&F92.4Unverified
7R50-DeAOT-LJ&F92.3Unverified
8R50-AOST (L'=3)J&F92.1Unverified
9QDMNJ&F92Unverified
10DeAOT-LJ&F92Unverified
#ModelMetricClaimedVerifiedStatus
1Cutie+ (base, MEGA)J&F88.1Unverified
2Cutie (base, MEGA)J&F86.1Unverified
3Cutie+ (base)J&F85.9Unverified
4SwinB-AOST (L'=3, MS)J&F84.7Unverified
5SwinB-AOTv2-LJ&F84.5Unverified
6JIMD-R50J&F83.9Unverified
7XMem (BL30K, MS)J&F83.7Unverified
8DEVAJ&F83.2Unverified
9XMem (MS)J&F83.1Unverified
10SwinB-DeAOT-LJ&F82.8Unverified