SOTAVerified

Semi-Supervised Video Object Segmentation

The semi-supervised scenario assumes the user inputs a full mask of the object(s) of interest in the first frame of a video sequence. Methods have to produce the segmentation mask for that object(s) in the subsequent frames.

Papers

Showing 110 of 147 papers

TitleStatusHype
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation0
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Memory Matching is not Enough: Jointly Improving Memory Matching and Decoding for Video Object Segmentation0
SAM 2: Segment Anything in Images and VideosCode11
Global Motion Understanding in Large-Scale Video Object Segmentation0
Spatial-Temporal Multi-level Association for Video Object Segmentation0
Efficient Video Object Segmentation via Modulated Cross-Attention MemoryCode2
Video Object Segmentation with Dynamic Query ModulationCode1
Show:102550
← PrevPage 1 of 15Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SAM2J&F77.9Unverified
2Cutie+ (base, MEGA)J&F71.7Unverified
3Cutie+ (small, MEGA)J&F70.3Unverified
4Cutie (base, MEGA)J&F69.9Unverified
5Cutie (small, MEGA)J&F68.6Unverified
6Cutie (base, with mose)J&F68.3Unverified
7Cutie (small, with mose)J&F67.4Unverified
8DEVA (with OVIS)J&F66.5Unverified
9Cutie (base)J&F64Unverified
10Cutie (small)J&F62.2Unverified