SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 101150 of 551 papers

TitleStatusHype
Guided Interactive Video Object Segmentation Using Reliability-Based Attention MapsCode1
Differentiable Soft-Masked AttentionCode1
Directional Deep Embedding and Appearance Learning for Fast Video Object SegmentationCode1
CrOC: Cross-View Online Clustering for Dense Visual Representation LearningCode1
Reciprocal Transformations for Unsupervised Video Object SegmentationCode1
Learning Spatio-Appearance Memory Network for High-Performance Visual TrackingCode1
Hierarchical Memory Matching Network for Video Object SegmentationCode1
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static ImagesCode1
Adaptive Multi-source Predictor for Zero-shot Video Object SegmentationCode1
Do Different Tracking Tasks Require Different Appearance Models?Code1
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object SegmentationCode1
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video SegmentationCode1
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking TasksCode1
Event-Free Moving Object Segmentation from Moving Ego VehicleCode1
Contrastive Transformation for Self-supervised Correspondence LearningCode1
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual GroupingCode1
Modular Interactive Video Object Segmentation: Interaction-to-Mask, Propagation and Difference-Aware FusionCode1
Collaborative Video Object Segmentation by Multi-Scale Foreground-Background IntegrationCode1
Collaborative Video Object Segmentation by Foreground-Background IntegrationCode1
Efficient Regional Memory Network for Video Object SegmentationCode1
MATNet: Motion-Attentive Transition Network for Zero-Shot Video Object SegmentationCode1
Make One-Shot Video Object Segmentation Efficient AgainCode1
In-N-Out Generative Learning for Dense Unsupervised Video SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Accelerating Video Object Segmentation with Compressed VideoCode1
Making a Case for 3D Convolutions for Object Segmentation in VideosCode1
LoSh: Long-Short Text Joint Prediction Network for Referring Video Object SegmentationCode1
1st Place Solution for 5th LSVOS Challenge: Referring Video Object SegmentationCode1
Local-Global Context Aware Transformer for Language-Guided Video SegmentationCode1
MAST: A Memory-Augmented Self-supervised TrackerCode1
Motion-Attentive Transition for Zero-Shot Video Object SegmentationCode1
Learning to Recommend Frame for Interactive Video Object Segmentation in the WildCode1
FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical FlowCode1
Learning Quality-aware Dynamic Memory for Video Object SegmentationCode1
Learning Video Object Segmentation from Unlabeled VideosCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic PerspectiveCode1
Learning Motion-Appearance Co-Attention for Zero-Shot Video Object SegmentationCode1
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object SegmentationCode1
Learning to Learn Better for Video Object SegmentationCode1
Learning Fast and Robust Target Models for Video Object SegmentationCode1
Learning Object Depth from Camera Motion and Video Object SegmentationCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
Fast Template Matching and Update for Video Object Tracking and SegmentationCode1
Full-Duplex Strategy for Video Object SegmentationCode1
LVOS: A Benchmark for Long-term Video Object SegmentationCode1
Fast Video Object Segmentation using the Global Context ModuleCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Learning What to Learn for Video Object SegmentationCode1
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified