SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 501550 of 551 papers

TitleStatusHype
The 2018 DAVIS Challenge on Video Object Segmentation0
The 2019 DAVIS Challenge on VOS: Unsupervised Multi-Object Segmentation0
The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation0
The Instance-centric Transformer for the RVOS Track of LSVOS Challenge: 3rd Place Solution0
The Second Place Solution for The 4th Large-scale Video Object Segmentation Challenge--Track 3: Referring Video Object Segmentation0
THU-Warwick Submission for EPIC-KITCHEN Challenge 2025: Semi-Supervised Video Object Segmentation0
DyStaB: Unsupervised Object Segmentation via Dynamic-Static Bootstrapping0
TokenCut: Segmenting Objects in Images and Videos with Self-supervised Transformer and Normalized Cut0
Towards Good Practices for Video Object Segmentation0
Track Anything Behind Everything: Zero-Shot Amodal Video Object Segmentation0
Training-Free Robust Interactive Video Object Segmentation0
TrickVOS: A Bag of Tricks for Video Object Segmentation0
Tsanet: Temporal and Scale Alignment for Unsupervised Video Object Segmentation0
Two-Stream Networks for Object Segmentation in Videos0
Understanding Video Transformers via Universal Concept Discovery0
U-Net Based Multi-instance Video Object Segmentation0
UNINEXT-Cutie: The 1st Solution for LSVOS Challenge RVOS Track0
Unsupervised RGBD Video Object Segmentation Using GANs0
Unsupervised Video Object Segmentation using Motion Saliency-Guided Spatio-Temporal Propagation0
Unsupervised Video Object Segmentation with Distractor-Aware Online Adaptation0
Unsupervised Video Object Segmentation with Motion-based Bilateral Networks0
Unsupervised Video Object Segmentation with Joint Hotspot Tracking0
Unsupervised Video Object Segmentation with Online Adversarial Self-Tuning0
Unsupervised Video Segmentation via Spatio-Temporally Nonlocal Appearance Learning0
Value of Temporal Dynamics Information in Driving Scene Segmentation0
VideoClick: Video Object Segmentation with a Single Click0
Video Decomposition Prior: A Methodology to Decompose Videos into Layers0
Video Human Segmentation using Fuzzy Object Models and its Application to Body Pose Estimation of Toddlers for Behavior Studies0
VideoMatch: Matching based Video Object Segmentation0
Video Object of Interest Segmentation0
Video Object Segmentation and Tracking: A Survey0
Video Object Segmentation by Learning Location-Sensitive Embeddings0
Video Object Segmentation through Spatially Accurate and Temporally Dense Extraction of Primary Object Regions0
Video Object Segmentation Using Global and Instance Embedding Learning0
Video Object Segmentation using Tracked Object Proposals0
Video Object Segmentation via SAM 2: The 4th Solution for LSVOS Challenge VOS Track0
Video Object Segmentation with Joint Re-identification and Attention-Aware Mask Propagation0
Video Object Segmentation with Language Referring Expressions0
Video Object Segmentation Without Temporal Information0
Video Propagation Networks0
Video Salient Object Detection Using Spatiotemporal Deep Features0
Video Salient Object Detection via Contrastive Features and Attention Modules0
Video Segmentation via Object Flow0
VIDiff: Translating Videos via Multi-Modal Instructions with Diffusion Models0
Visual Semantic Segmentation Based on Few/Zero-Shot Learning: An Overview0
Adversarial Framework for Unsupervised Learning of Motion Dynamics in Videos0
YouMVOS: An Actor-Centric Multi-Shot Video Object Segmentation Dataset0
YouTube-VOS: A Large-Scale Video Object Segmentation Benchmark0
Zero-Shot 4D Lidar Panoptic Segmentation0
Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models0
Show:102550
← PrevPage 11 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified