SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 201250 of 551 papers

TitleStatusHype
Isomer: Isomerous Transformer for Zero-shot Video Object SegmentationCode1
Autoencoder-based background reconstruction and foreground segmentation with background noise estimationCode1
Local-Global Context Aware Transformer for Language-Guided Video SegmentationCode1
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting FramesCode0
Lucid Data Dreaming for Video Object SegmentationCode0
Efficient Video Object Segmentation via Network ModulationCode0
Temporal Transductive Inference for Few-Shot Video Object SegmentationCode0
LSMVOS: Long-Short-Term Similarity Matching for Video ObjectCode0
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-trainingCode0
Box Supervised Video Segmentation Proposal NetworkCode0
Stable Mean Teacher for Semi-supervised Video Action DetectionCode0
Sub-token ViT Embedding via Stochastic Resonance TransformersCode0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
Spatiotemporal CNN for Video Object SegmentationCode0
Adaptive Temporal Encoding Network for Video Instance-level Human ParsingCode0
Learning Video Object Segmentation from Static ImagesCode0
Learning Unsupervised Video Object Segmentation Through Visual AttentionCode0
Separable Structure Modeling for Semi-supervised Video Object SegmentationCode0
Boosting Video Object Segmentation based on Scale InconsistencyCode0
Shifting More Attention to Video Salient Object DetectionCode0
Adaptive ROI Generation for Video Object Segmentation Using Reinforcement LearningCode0
Siamese Network with Interactive Transformer for Video Object SegmentationCode0
DMM-Net: Differentiable Mask-Matching Network for Video Object SegmentationCode0
Self-supervised Amodal Video Object SegmentationCode0
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical videoCode0
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOSCode0
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited SamplesCode0
SegFlow: Joint Learning for Video Object Segmentation and Optical FlowCode0
Learning Correspondence from the Cycle-Consistency of TimeCode0
Semi-supervised Active Learning for Video Action DetectionCode0
Strike the Balance: On-the-Fly Uncertainty based User Interactions for Long-Term Video Object SegmentationCode0
Adaptive Memory Management for Video Object SegmentationCode0
Revisiting Click-based Interactive Video Object SegmentationCode0
DTOS: Dynamic Time Object Sensing with Large Multimodal ModelCode0
Revisiting Sequence-to-Sequence Video Object Segmentation with Multi-Task Loss and Skip-MemoryCode0
Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency RelearningCode0
Adaptive Masked Proxies for Few-Shot SegmentationCode0
ReferDINO-Plus: 2nd Solution for 4th PVUW MeViS Challenge at CVPR 2025Code0
RVOS: End-to-End Recurrent Network for Video Object SegmentationCode0
Self-supervised Video Object SegmentationCode0
Implicit Motion-Compensated Network for Unsupervised Video Object SegmentationCode0
Deep Extreme Cut: From Extreme Points to Object SegmentationCode0
Illumination-Based Data Augmentation for Robust Background SubtractionCode0
Hybrid-S2S: Video Object Segmentation with Recurrent Networks and Correspondence MatchingCode0
RANet: Ranking Attention Network for Fast Video Object SegmentationCode0
Ground-truth or DAER: Selective Re-query of Secondary InformationCode0
Holistic Prototype Attention Network for Few-Shot VOSCode0
Proposal, Tracking and Segmentation (PTS): A Cascaded Network for Video Object SegmentationCode0
PReMVOS: Proposal-generation, Refinement and Merging for Video Object SegmentationCode0
A 3D Convolutional Approach to Spectral Object Segmentation in Space and TimeCode0
Show:102550
← PrevPage 5 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified