SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 51100 of 551 papers

TitleStatusHype
DC-SAM: In-Context Segment Anything in Images and Videos via Dual ConsistencyCode1
MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object SegmentationCode1
Learning Motion and Temporal Cues for Unsupervised Video Object SegmentationCode1
M^3-VOS: Multi-Phase, Multi-Transition, and Multi-Scenery Video Object SegmentationCode1
Referring Video Object Segmentation via Language-aligned Track SelectionCode1
Multi-Granularity Video Object SegmentationCode1
LiVOS: Light Video Object Segmentation with Gated Linear MatchingCode1
X-Prompt: Multi-modal Visual Prompt for Video Object SegmentationCode1
ActionVOS: Actions as Prompts for Video Object SegmentationCode1
Video Inpainting Localization with Contrastive LearningCode1
1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video SegmentationCode1
Event-assisted Low-Light Video Object SegmentationCode1
Temporally Consistent Referring Video Object Segmentation with Hybrid MemoryCode1
Exploring Pre-trained Text-to-Video Diffusion Models for Referring Video Object SegmentationCode1
Video Object Segmentation with Dynamic Query ModulationCode1
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment AnythingCode1
Depth-aware Test-Time Training for Zero-shot Video Object SegmentationCode1
VideoMAC: Video Masked Autoencoders Meet ConvNetsCode1
Lester: rotoscope animation through video object segmentation and trackingCode1
1st Place Solution for 5th LSVOS Challenge: Referring Video Object SegmentationCode1
Tracking with Human-Intent ReasoningCode1
Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object SegmentationCode1
SEGIC: Unleashing the Emergent Correspondence for In-Context SegmentationCode1
Treating Motion as Option with Output Selection for Unsupervised Video Object SegmentationCode1
PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video SegmentationCode1
Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and SegmentationCode1
Isomer: Isomerous Transformer for Zero-shot Video Object SegmentationCode1
Spectrum-guided Multi-granularity Referring Video Object SegmentationCode1
OnlineRefer: A Simple Online Baseline for Referring Video Object SegmentationCode1
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object SegmentationCode1
LoSh: Long-Short Text Joint Prediction Network for Referring Video Object SegmentationCode1
SOC: Semantic-Assisted Object Cluster for Referring Video Object SegmentationCode1
Referred by Multi-Modality: A Unified Temporal Transformer for Video Object SegmentationCode1
UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything ModelCode1
Event-Free Moving Object Segmentation from Moving Ego VehicleCode1
Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual GroupingCode1
Segment Everything Everywhere All at OnceCode1
Boosting Video Object Segmentation via Space-time Correspondence LearningCode1
DropMAE: Masked Autoencoders with Spatial-Attention Dropout for Tracking TasksCode1
Reliability-Hierarchical Memory Network for Scribble-Supervised Video Object SegmentationCode1
CrOC: Cross-View Online Clustering for Dense Visual Representation LearningCode1
Two-shot Video Object SegmentationCode1
Adaptive Multi-source Predictor for Zero-shot Video Object SegmentationCode1
Guided Slot Attention for Unsupervised Video Object SegmentationCode1
Self-Supervised Unseen Object Instance Segmentation via Long-Term Robot InteractionCode1
TarViS: A Unified Approach for Target-based Video SegmentationCode1
End-to-End Video Matting With Trimap PropagationCode1
Video Object Segmentation-aware Video Frame InterpolationCode1
1st Place Solution for YouTubeVOS Challenge 2022: Referring Video Object SegmentationCode1
Learning to Learn Better for Video Object SegmentationCode1
Show:102550
← PrevPage 2 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified