SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 301350 of 551 papers

TitleStatusHype
Iteratively Selecting an Easy Reference Frame Makes Unsupervised Video Object Segmentation Easier0
A Discriminative Single-Shot Segmentation Network for Visual Object Tracking0
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static ImagesCode1
Autoencoder-based background reconstruction and foreground segmentation with background noise estimationCode1
Reliable Propagation-Correction Modulation for Video Object SegmentationCode1
MUNet: Motion Uncertainty-aware Semi-supervised Video Object Segmentation0
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
Learning To Segment Dominant Object Motion From Watching Videos0
Hierarchical interaction network for video object segmentation from referring expressions0
FlowVOS: Weakly-Supervised Visual Warping for Detail-Preserving and Temporally Consistent Single-Shot Video Object Segmentation0
FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical FlowCode1
D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
Dense Unsupervised Learning for Video SegmentationCode1
Video Salient Object Detection via Contrastive Features and Attention Modules0
Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic PerspectiveCode1
SiamPolar: Semi-supervised Realtime Video Object Segmentation with Polar Representation0
Multi-Object Tracking and Segmentation with a Space-Time Memory Network0
Pixel-Level Bijective Matching for Video Object SegmentationCode1
Hierarchical Memory Matching Network for Video Object SegmentationCode1
Space Time Recurrent Memory Network0
Shifted Chunk Transformer for Spatio-Temporal Representational Learning0
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane DetectionCode1
Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object SegmentationCode1
Joint Inductive and Transductive Learning for Video Object SegmentationCode1
Full-Duplex Strategy for Video Object SegmentationCode1
Self-Supervised Video Object Segmentation by Motion-Aware Mask PropagationCode1
Accelerating Video Object Segmentation with Compressed VideoCode1
MeNToS: Tracklets Association with a Space-Time Memory Network0
Fast Pixel-Matching for Video Object SegmentationCode0
Do Different Tracking Tasks Require Different Appearance Models?Code1
Reciprocal Transformations for Unsupervised Video Object SegmentationCode1
Delving Deep Into Many-to-Many Attention for Few-Shot Video Object SegmentationCode1
Video Object Segmentation Using Global and Instance Embedding Learning0
MSN: Efficient Online Mask Selection Network for Video Instance SegmentationCode0
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object SegmentationCode1
SynthRef: Generation of Synthetic Referring Expressions for Object SegmentationCode1
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive BiasCode1
Associating Objects with Transformers for Video Object SegmentationCode1
Rethinking Cross-modal Interaction from a Top-down Perspective for Referring Video Object Segmentation0
TransVOS: Video Object Segmentation with TransformersCode1
Polygonal Point Set TrackingCode1
Attention-guided Temporally Coherent Video Object MattingCode1
DAVOS: Semi-Supervised Video Object Segmentation via Adversarial Domain Adaptation0
Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence LearningCode1
Emerging Properties in Self-Supervised Vision TransformersCode1
Guided Interactive Video Object Segmentation Using Reliability-Based Attention MapsCode1
Self-supervised Video Object Segmentation by Motion Grouping0
Target-Aware Object Discovery and Association for Unsupervised Video Multi-Object Segmentation0
Learning Position and Target Consistency for Memory-based Video Object Segmentation0
Show:102550
← PrevPage 7 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified