SOTAVerified

Video Instance Segmentation

The goal of video instance segmentation is simultaneous detection, segmentation and tracking of instances in videos. In words, it is the first time that the image instance segmentation problem is extended to the video domain.

To facilitate research on this new task, a large-scale benchmark called YouTube-VIS, which consists of 2,883 high-resolution YouTube videos, a 40-category label set and 131k high-quality instance masks is built.

Papers

Showing 101148 of 148 papers

TitleStatusHype
What is Point Supervision Worth in Video Instance Segmentation?0
CML-MOTS: Collaborative Multi-task Learning for Multi-Object Tracking and Segmentation0
Deep Learning Techniques for Video Instance Segmentation: A Survey0
TCOVIS: Temporally Consistent Online Video Instance SegmentationCode0
NOVIS: A Case for End-to-End Near-Online Video Instance Segmentation0
1st Place Solution for CVPR2023 BURST Long Tail and Open World Challenges0
3rd Place Solution for PVUW Challenge 2023: Video Panoptic Segmentation0
RefineVIS: Video Instance Segmentation with Temporal Attention Refinement0
GRAtt-VIS: Gated Residual Attention for Auto Rectifying Video Instance SegmentationCode0
Video Instance Segmentation in an Open-WorldCode0
MobileInst: Video Instance Segmentation on the Mobile0
Offline-to-Online Knowledge Distillation for Video Instance Segmentation0
Maximal Cliques on Multi-Frame Proposal Graph for Unsupervised Video Object Segmentation0
Towards Robust Video Instance Segmentation with Temporal-Aware Transformer0
InsPro: Propagating Instance Query and Proposal for Online Video Instance Segmentation0
Object Segmentation with Audio Context0
The Runner-up Solution for YouTube-VIS Long Video Challenge 20220
Robust Online Video Instance Segmentation with Track QueriesCode0
Quantifying and Learning Static vs. Dynamic Information in Deep Spatiotemporal Networks0
Two-Level Temporal Relation Model for Online Video Instance SegmentationCode0
Online Video Instance Segmentation via Robust Context Fusion0
Consistent Video Instance Segmentation with Inter-Frame Recurrent Attention0
Tag-Based Attention Guided Bottom-Up Approach for Video Instance Segmentation0
Less than Few: Self-Shot Video Instance Segmentation0
Human Instance Segmentation and Tracking via Data Association and Single-stage Detector0
Deformable VisTR: Spatio temporal deformable attention for video instance segmentationCode0
One-stage Video Instance Segmentation: From Frame-in Frame-out to Clip-in Clip-outCode0
End-to-end video instance segmentation via spatial-temporal graph neural networksCode0
Efficient Video Instance Segmentation via Tracklet Query and Proposal0
Efficient Video Segmentation Models with Per-frame Inference0
STC: Spatio-Temporal Contrastive Learning for Video Instance Segmentation0
A Graph Matching Perspective With Transformers on Video Instance Segmentation0
Hybrid Instance-aware Temporal Fusion for Online Video Instance Segmentation0
Object Propagation via Inter-Frame Attentions for Temporally Stable Video Instance SegmentationCode0
Occluded Video Instance Segmentation: Dataset and ICCV 2021 Challenge0
Video Instance Segmentation by Instance Flow Assembly0
Temporal RoI Align for Video Object Recognition0
False Negative Reduction in Video Instance Segmentation using Uncertainty EstimatesCode0
MSN: Efficient Online Mask Selection Network for Video Instance SegmentationCode0
1st Place Solution for YouTubeVOS Challenge 2021:Video Instance Segmentation0
Contextual Guided Segmentation Framework for Semi-supervised Video Instance Segmentation0
A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline0
Improving Video Instance Segmentation by Light-weight Temporal Uncertainty EstimatesCode0
Learning Video Instance Segmentation with Recurrent Graph Neural Networks0
Video Instance Segmentation Tracking With a Modified VAE Architecture0
Learning a Spatio-Temporal Embedding for Video Instance SegmentationCode0
Classifying, Segmenting, and Tracking Object Instances in Video with Mask Propagation0
Efficient Video Object Segmentation via Network ModulationCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DVIS-DAQ(VIT-L, Offline)mask AP57.1Unverified
2CAVIS(VIT-L, Offline)mask AP57.1Unverified
3DVIS++(VIT-L,Offline)mask AP53.4Unverified
4GLEE-Promask AP50.4Unverified
5DVIS(Swin-L, Offline)mask AP49.9Unverified
6DVIS++(VIT-L, Online)mask AP49.6Unverified
7UNINEXT (ViT-H, Online)mask AP49Unverified
8DVIS(Swin-L, Online)mask AP47.1Unverified
9CTVIS (Swin-L)mask AP46.9Unverified
10RefineVIS (Swin-L, offline)mask AP46Unverified
#ModelMetricClaimedVerifiedStatus
1CAVIS(ViT-L, Online)mask AP68.9Unverified
2DVIS++(ViT-L, Online)mask AP67.7Unverified
3DVISmask AP64.9Unverified
4Tube-Linkmask AP64.6Unverified
5MinVIS (Swin-L)mask AP61.6Unverified
6Mask2Former (Swin-L)mask AP60.4Unverified
7UniVS(Swin-L)mask AP60Unverified
8MDQE(Swin-L)mask AP59.9Unverified
9SeqFormer (Swin-L)mask AP59.3Unverified
10DeVIS (Swin-L)mask AP57.1Unverified
#ModelMetricClaimedVerifiedStatus
1CAVIS(VIT-L, Offline)mask AP65.3Unverified
2DVIS-DAQ(VIT-L, Offline)mask AP64.5Unverified
3DVIS++(VIT-L, Offline)mask AP63.9Unverified
4DVIS++(VIT-L, Online)mask AP62.3Unverified
5RefineVIS (Swin-L, online)mask AP61.4Unverified
6GRAtt-VIS (Swin-L)mask AP60.3Unverified
7TarViS (Swin-L)mask AP60.2Unverified
8GenVIS (Swin-L)mask AP60.1Unverified
9DVIS(Swin-L)mask AP60.1Unverified
10NOVIS (Swin-L)mask AP59.8Unverified
#ModelMetricClaimedVerifiedStatus
1DVIS++(VIT-L)mAP_L50.9Unverified
2CAVIS (VIT-L)mAP_L48.6Unverified
3CTVIS (Swin-L)mAP_L46.4Unverified
4DVIS(Swin-L)mAP_L45.9Unverified
5CTVIS (ResNet-50)mAP_L39.4Unverified
6InstanceFormer (Swin)mAP_L26.3Unverified
7InstanceFormer (Resnet-50)mAP_L24.8Unverified
#ModelMetricClaimedVerifiedStatus
1PCANmMOTSA27.4Unverified
2QDTrack-mots-fixmMOTSA23.5Unverified
3QDTrack-motsmMOTSA22.5Unverified
4MaskTrackRCNNmMOTSA12.3Unverified
5STEm-SegmMOTSA12.2Unverified
6SortIoUmMOTSA10.3Unverified
#ModelMetricClaimedVerifiedStatus
1VMT (Swin-L)Tube-Boundary AP44.8Unverified
2SeqFormer (Swin-L)Tube-Boundary AP43.3Unverified
3VMT (R101)Tube-Boundary AP32.5Unverified
4VMT (R50)Tube-Boundary AP30.7Unverified
#ModelMetricClaimedVerifiedStatus
1Temporal ROI Alignmask AP38Unverified
#ModelMetricClaimedVerifiedStatus
1MaskFreeVISAP55.3Unverified