SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 101150 of 551 papers

TitleStatusHype
Dual Prototype Attention for Unsupervised Video Object SegmentationCode1
LVOS: A Benchmark for Long-term Video Object SegmentationCode1
Global Spectral Filter Memory Network for Video Object SegmentationCode1
Self-supervised Video Representation Learning with Motion-Aware Masked AutoencodersCode1
EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object RelationsCode1
BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in VideoCode1
A Simple and Powerful Global Optimization for Unsupervised Video Object SegmentationCode1
Unsupervised Video Object Segmentation via Prototype Memory NetworkCode1
Treating Motion as Option to Reduce Motion Dependency in Unsupervised Video Object SegmentationCode1
SWEM: Towards Real-Time Video Object Segmentation with Sequential Weighted Expectation-MaximizationCode1
Per-Clip Video Object SegmentationCode1
Multi-Attention Network for Compressed Video Referring Object SegmentationCode1
Semantic-Aware Fine-Grained CorrespondenceCode1
Hierarchical Feature Alignment Network for Unsupervised Video Object SegmentationCode1
Learning Quality-aware Dynamic Memory for Video Object SegmentationCode1
Tackling Background Distraction in Video Object SegmentationCode1
Towards Robust Referring Video Object Segmentation with Cyclic Relational ConsensusCode1
Towards Robust Video Object Segmentation with Adaptive Object CalibrationCode1
Language-Bridged Spatial-Temporal Interaction for Referring Video Object SegmentationCode1
A Deeper Dive Into What Deep Spatiotemporal Networks Encode: Quantifying Static vs. Dynamic InformationCode1
Differentiable Soft-Masked AttentionCode1
Recurrent Dynamic Embedding for Video Object SegmentationCode1
In-N-Out Generative Learning for Dense Unsupervised Video SegmentationCode1
Robust Visual Tracking by SegmentationCode1
Local-Global Context Aware Transformer for Language-Guided Video SegmentationCode1
End-to-End Semi-Supervised Learning for Video Action DetectionCode1
Wnet: Audio-Guided Video Object Segmentation via Wavelet-Based Cross-Modal Denoising NetworksCode1
HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static ImagesCode1
Autoencoder-based background reconstruction and foreground segmentation with background noise estimationCode1
Reliable Propagation-Correction Modulation for Video Object SegmentationCode1
End-to-End Referring Video Object Segmentation with Multimodal TransformersCode1
FAMINet: Learning Real-time Semi-supervised Video Object Segmentation with Steepest Optimized Optical FlowCode1
D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in VideosCode1
Dense Unsupervised Learning for Video SegmentationCode1
Exploring the Semi-supervised Video Object Segmentation Problem from a Cyclic PerspectiveCode1
Pixel-Level Bijective Matching for Video Object SegmentationCode1
Hierarchical Memory Matching Network for Video Object SegmentationCode1
VIL-100: A New Dataset and A Baseline Model for Video Instance Lane DetectionCode1
Multi-Source Fusion and Automatic Predictor Selection for Zero-Shot Video Object SegmentationCode1
Joint Inductive and Transductive Learning for Video Object SegmentationCode1
Full-Duplex Strategy for Video Object SegmentationCode1
Self-Supervised Video Object Segmentation by Motion-Aware Mask PropagationCode1
Accelerating Video Object Segmentation with Compressed VideoCode1
Do Different Tracking Tasks Require Different Appearance Models?Code1
Delving Deep Into Many-to-Many Attention for Few-Shot Video Object SegmentationCode1
Reciprocal Transformations for Unsupervised Video Object SegmentationCode1
Rethinking Space-Time Networks with Improved Memory Coverage for Efficient Video Object SegmentationCode1
SynthRef: Generation of Synthetic Referring Expressions for Object SegmentationCode1
ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive BiasCode1
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified