SOTAVerified

Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Papers

Showing 125 of 147 papers

TitleStatusHype
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation0
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation0
SAM 2: Segment Anything in Images and VideosCode11
Global Motion Understanding in Large-Scale Video Object Segmentation0
Spatial-Temporal Multi-level Association for Video Object Segmentation0
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
Video Object Segmentation with Dynamic Query ModulationCode1
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment AnythingCode1
Lester: rotoscope animation through video object segmentation and trackingCode1
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution0
Putting the Object Back into Video Object SegmentationCode3
Sub-token ViT Embedding via Stochastic Resonance TransformersCode0
Memory-Efficient Continual Learning Object Segmentation for Long Video0
Tracking Anything with Decoupled Video SegmentationCode3
XMem++: Production-level Video Segmentation From Few Annotated FramesCode2
Tracking Anything in High QualityCode2
Hierarchical Spatiotemporal Transformers for Video Object Segmentation0
ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation0
TrickVOS: A Bag of Tricks for Video Object Segmentation0
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object SegmentationCode0
Video Object Segmentation in Panoptic Wild ScenesCode2
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SAM2J&F90.7Unverified
2Cutie+ (base)J&F90.5Unverified
3ISVOS (BL30K, MS)J&F89.8Unverified
4XMem (BL30K, MS)J&F89.5Unverified
5ISVOS (MS)J&F88.6Unverified
6ISVOS (BL30K)J&F88.2Unverified
7XMem (MS)J&F88.2Unverified
8Cutie+ (base, MEGA)J&F88.1Unverified
9JIMDJ&F88.1Unverified
10Cutie (base)J&F87.9Unverified
#ModelMetricClaimedVerifiedStatus
1SwinB-AOST (L'=3, MS)J&F93Unverified
2SwinB-AOTv2-L (MS)J&F93Unverified
3SwinB-DeAOT-LJ&F92.9Unverified
4XMem (MS)J&F92.7Unverified
5SwinB-AOTv2-LJ&F92.4Unverified
6SwinB-AOST (L'=3)J&F92.4Unverified
7R50-DeAOT-LJ&F92.3Unverified
8R50-AOST (L'=3)J&F92.1Unverified
9QDMNJ&F92Unverified
10DeAOT-LJ&F92Unverified
#ModelMetricClaimedVerifiedStatus
1Cutie+ (base, MEGA)J&F88.1Unverified
2Cutie (base, MEGA)J&F86.1Unverified
3Cutie+ (base)J&F85.9Unverified
4SwinB-AOST (L'=3, MS)J&F84.7Unverified
5SwinB-AOTv2-LJ&F84.5Unverified
6JIMD-R50J&F83.9Unverified
7XMem (BL30K, MS)J&F83.7Unverified
8DEVAJ&F83.2Unverified
9XMem (MS)J&F83.1Unverified
10SwinB-DeAOT-LJ&F82.8Unverified