SOTAVerified

Video Object Segmentation

Video object segmentation is a binary labeling problem aiming to separate foreground object(s) from the background region of a video.

For leaderboards please refer to the different subtasks.

Papers

Showing 251300 of 551 papers

TitleStatusHype
Temporal Transductive Inference for Few-Shot Video Object SegmentationCode0
BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting FramesCode0
Continuous Spatio-Temporal Memory Networks for 4D Cardiac Cine MRI SegmentationCode0
Fast Interactive Video Object Segmentation with Graph Neural NetworksCode0
Multiscale Memory Comparator Transformer for Few-Shot Video SegmentationCode0
AGSS-VOS: Attention Guided Single-Shot Video Object SegmentationCode0
Video Object Segmentation With Dynamic Memory Networks and Adaptive Object AlignmentCode0
MSN: Efficient Online Mask Selection Network for Video Instance SegmentationCode0
ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of ThoughtsCode0
CRVOS: Clue Refining Network for Video Object SegmentationCode0
Efficient Video Object Segmentation via Network ModulationCode0
ALBA : Reinforcement Learning for Video Object SegmentationCode0
Adaptive Memory Management for Video Object SegmentationCode0
Fast and Accurate Online Video Object Segmentation via Tracking PartsCode0
A 3D Convolutional Approach to Spectral Object Segmentation in Space and TimeCode0
Online Unsupervised Video Object Segmentation via Contrastive Motion ClusteringCode0
Box Supervised Video Segmentation Proposal NetworkCode0
Revisiting Sequence-to-Sequence Video Object Segmentation with Multi-Task Loss and Skip-MemoryCode0
Video Object Segmentation with Re-identificationCode0
Curriculum Learning for Recurrent Video Object SegmentationCode0
Reducing Annotation Burden: Exploiting Image Knowledge for Few-Shot Medical Video Object Segmentation via Spatiotemporal Consistency RelearningCode0
Mask Selection and Propagation for Unsupervised Video Object SegmentationCode0
MASSeg : 2nd Technical Report for 4th PVUW MOSE TrackCode0
Multi-Context Temporal Consistent Modeling for Referring Video Object SegmentationCode0
LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-trainingCode0
Trusted Video Inpainting Localization via Deep Attentive Noise LearningCode0
Learning Correspondence from the Cycle-Consistency of TimeCode0
Video State-Changing Object SegmentationCode0
Multi-grained Temporal Prototype Learning for Few-shot Video Object SegmentationCode0
Two-Level Temporal Relation Model for Online Video Instance SegmentationCode0
ViDSOD-100: A New Dataset and a Baseline Model for RGB-D Video Salient Object DetectionCode0
Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited SamplesCode0
Discriminative Spatial-Semantic VOS Solution: 1st Place Solution for 6th LSVOSCode0
READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object SegmentationCode0
Unified Mask Embedding and Correspondence Learning for Self-Supervised Video SegmentationCode0
Annolid: Annotate, Segment, and Track Anything You NeedCode0
Revisiting Click-based Interactive Video Object SegmentationCode0
Flexible visual prompts for in-context learning in computer visionCode0
Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence MatchingCode0
Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical videoCode0
Unsupervised Online Video Object Segmentation with Motion Property UnderstandingCode0
RANet: Ranking Attention Network for Fast Video Object SegmentationCode0
Unsupervised Video Object Segmentation for Deep Reinforcement LearningCode0
Fast Video Object Segmentation by Reference-Guided Mask PropagationCode0
DMM-Net: Differentiable Mask-Matching Network for Video Object SegmentationCode0
D3S -- A Discriminative Single Shot Segmentation TrackerCode0
Biomedical SAM 2: Segment Anything in Biomedical Images and VideosCode0
Wandering around: A bioinspired approach to visual attention through object motion sensitivityCode0
Ground-truth or DAER: Selective Re-query of Secondary InformationCode0
Anchor Diffusion for Unsupervised Video Object SegmentationCode0
Show:102550
← PrevPage 6 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)F-Score94.7Unverified
2ISVOS (BL30K, MS)J&F93.4Unverified
3XMem (BL30K, MS)J&F93.3Unverified
4BATMAN (val)J&F92.5Unverified
5STCN (val)J&F91.6Unverified
6XMemJ&F91.5Unverified
7MobileVOS (val)J&F91.4Unverified
8AOT (val)J&F91.1Unverified
9LCM (val)J&F90.7Unverified
10RPCMVOS (val)J&F90.6Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BLK30K, MS)Mean Jaccard & F-Measure89.5Unverified
2LCMF-measure86.5Unverified
3XMemMean Jaccard & F-Measure86.2Unverified
4BATMANMean Jaccard & F-Measure86.2Unverified
5STCNMean Jaccard & F-Measure85.4Unverified
6AOTMean Jaccard & F-Measure84.9Unverified
7STMF-measure84.3Unverified
8TransVOSMean Jaccard & F-Measure83.9Unverified
9RPCMVOSMean Jaccard & F-Measure83.7Unverified
10RMNMean Jaccard & F-Measure83.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure86.9Unverified
2AOTMean Jaccard & F-Measure84.1Unverified
3RPCMVOSMean Jaccard & F-Measure84Unverified
4STCNMean Jaccard & F-Measure83Unverified
5CFBI+Mean Jaccard & F-Measure82.8Unverified
6RMNJaccard (Seen)82.1Unverified
7LCMMean Jaccard & F-Measure82Unverified
8TransVOSMean Jaccard & F-Measure81.8Unverified
9SSTMean Jaccard & F-Measure81.7Unverified
10LWLMean Jaccard & F-Measure81.5Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K, MS)Mean Jaccard & F-Measure83.7Unverified
2XMemMean Jaccard & F-Measure81Unverified
3BATMANJaccard78.4Unverified
4AOTJaccard75.9Unverified
5RPCMVOSJaccard75.8Unverified
6LCMJaccard74.4Unverified
7KMNJaccard74.1Unverified
8TransVOSJaccard73Unverified
9STCNJaccard72.7Unverified
10RMNJaccard71.9Unverified
#ModelMetricClaimedVerifiedStatus
1XMem (BL30K,MS)Mean Jaccard & F-Measure86.8Unverified
2XMemMean Jaccard & F-Measure85.5Unverified
3BATMANMean Jaccard & F-Measure85Unverified
4AOTMean Jaccard & F-Measure84.1Unverified
5RPCMVOSMean Jaccard & F-Measure83.9Unverified
6MobileVOSMean Jaccard & F-Measure83.3Unverified
7STCNMean Jaccard & F-Measure82.7Unverified
8CFBI+Mean Jaccard & F-Measure82.6Unverified
9SSTMean Jaccard & F-Measure81.8Unverified
10CFBIMean Jaccard & F-Measure81Unverified
#ModelMetricClaimedVerifiedStatus
1AOC-MF (val)Jaccard (Mean)81.7Unverified
2ViTAE-T-StageJaccard (Mean)79.4Unverified
3DINO (ViT-B/8, ImageNet retrain)J&F71.4Unverified
4VOSwL (Mask+Language)mIoU59Unverified
5UniTrackmIoU58.4Unverified
#ModelMetricClaimedVerifiedStatus
1ReVOSAverage IOU75.6Unverified
2Cutie-baseAverage IOU74.6Unverified
3XMemAverage IOU70.4Unverified
4SAM 2Average IOU69.5Unverified
#ModelMetricClaimedVerifiedStatus
1DFNetF-Score82.3Unverified
2oursJaccard (Mean)76.7Unverified
#ModelMetricClaimedVerifiedStatus
1OursAverage74.9Unverified
2FEELVOSmIoU0.82Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU68.8Unverified
#ModelMetricClaimedVerifiedStatus
1CutieJ&F68.3Unverified
#ModelMetricClaimedVerifiedStatus
1LOCATEmIoU79.9Unverified