| Learning the What and How of Annotation in Video Object Segmentation | Nov 8, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |
| ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification | Nov 5, 2023 | Instance SegmentationMulti-Object Tracking | —Unverified | 0 |
| SpVOS: Efficient Video Object Segmentation with Triple Sparse Convolution | Oct 23, 2023 | ObjectSemantic Segmentation | —Unverified | 0 |
| Putting the Object Back into Video Object Segmentation | Oct 19, 2023 | ObjectSegmentation | CodeCode Available | 3 |
| Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models | Oct 10, 2023 | ObjectObject Tracking | —Unverified | 0 |
| Sub-token ViT Embedding via Stochastic Resonance Transformers | Oct 6, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 0 |
| CoralVOS: Dataset and Benchmark for Coral Video Segmentation | Oct 3, 2023 | SegmentationSemantic Segmentation | —Unverified | 0 |
| Memory-Efficient Continual Learning Object Segmentation for Long Video | Sep 26, 2023 | Continual LearningObject | —Unverified | 0 |
| Treating Motion as Option with Output Selection for Unsupervised Video Object Segmentation | Sep 26, 2023 | ObjectOptical Flow Estimation | CodeCode Available | 1 |
| Adversarial Attacks on Video Object Segmentation with Hard Region Discovery | Sep 25, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| Efficient Long-Short Temporal Attention Network for Unsupervised Video Object Segmentation | Sep 21, 2023 | Semantic SegmentationUnsupervised Video Object Segmentation | —Unverified | 0 |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Sep 21, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| Fully Transformer-Equipped Architecture for End-to-End Referring Video Object Segmentation | Sep 21, 2023 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Multi-grained Temporal Prototype Learning for Few-shot Video Object Segmentation | Sep 20, 2023 | Image SegmentationSegmentation | CodeCode Available | 0 |
| Temporal Collection and Distribution for Referring Video Object Segmentation | Sep 7, 2023 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Tracking Anything with Decoupled Video Segmentation | Sep 7, 2023 | Open-Vocabulary Video SegmentationOpen-World Video Segmentation | CodeCode Available | 3 |
| Robust Visual Tracking by Motion Analyzing | Sep 6, 2023 | Object TrackingSegmentation | —Unverified | 0 |
| Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples | Sep 5, 2023 | Referring Video Object SegmentationSemantic Segmentation | CodeCode Available | 0 |
| Joint Modeling of Feature, Correspondence, and a Compressed Memory for Video Object Segmentation | Aug 25, 2023 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation | Aug 25, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| LOCATE: Self-supervised Object Discovery via Flow-guided Graph-cut and Bootstrapped Self-training | Aug 22, 2023 | ObjectObject Discovery | CodeCode Available | 0 |
| Scalable Video Object Segmentation with Simplified Framework | Aug 19, 2023 | ObjectSemantic Segmentation | —Unverified | 0 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 |
| Isomer: Isomerous Transformer for Zero-shot Video Object Segmentation | Aug 13, 2023 | Semantic SegmentationVideo Object Segmentation | CodeCode Available | 1 |
| Expression Prompt Collaboration Transformer for Universal Referring Video Object Segmentation | Aug 8, 2023 | Contrastive LearningObject | CodeCode Available | 0 |
| Learning Referring Video Object Segmentation from Weak Annotation | Aug 4, 2023 | Contrastive LearningObject | —Unverified | 0 |
| XMem++: Production-level Video Segmentation From Few Annotated Frames | Jul 29, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| Tracking Anything in High Quality | Jul 26, 2023 | ObjectObject Tracking | CodeCode Available | 2 |
| Spectrum-guided Multi-granularity Referring Video Object Segmentation | Jul 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| OnlineRefer: A Simple Online Baseline for Referring Video Object Segmentation | Jul 18, 2023 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Hierarchical Spatiotemporal Transformers for Video Object Segmentation | Jul 17, 2023 | Inductive BiasObject | —Unverified | 0 |
| Holistic Prototype Attention Network for Few-Shot VOS | Jul 16, 2023 | Graph AttentionSemantic Segmentation | CodeCode Available | 0 |
| Multiscale Memory Comparator Transformer for Few-Shot Video Segmentation | Jul 15, 2023 | DecoderSegmentation | CodeCode Available | 0 |
| FODVid: Flow-guided Object Discovery in Videos | Jul 10, 2023 | ObjectObject Discovery | —Unverified | 0 |
| ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: TREK-150 Single Object Tracking | Jul 5, 2023 | ObjectObject Tracking | —Unverified | 0 |
| ZJU ReLER Submission for EPIC-KITCHEN Challenge 2023: Semi-Supervised Video Object Segmentation | Jul 5, 2023 | ObjectPosition | —Unverified | 0 |
| RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Object Segmentation | Jul 3, 2023 | Image SegmentationReferring Expression | CodeCode Available | 1 |
| Segment Anything Meets Point Tracking | Jul 3, 2023 | Interactive Video Object SegmentationObject | CodeCode Available | 3 |
| Bidirectional Correlation-Driven Inter-Frame Interaction Transformer for Referring Video Object Segmentation | Jul 2, 2023 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| TrickVOS: A Bag of Tricks for Video Object Segmentation | Jun 27, 2023 | DecoderObject | —Unverified | 0 |
| Online Unsupervised Video Object Segmentation via Contrastive Motion Clustering | Jun 21, 2023 | ClusteringContrastive Learning | CodeCode Available | 0 |
| LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation | Jun 14, 2023 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation | May 26, 2023 | cross-modal alignmentObject | CodeCode Available | 1 |
| Referred by Multi-Modality: A Unified Temporal Transformer for Video Object Segmentation | May 25, 2023 | ObjectReferring Expression Segmentation | CodeCode Available | 1 |
| AutoDepthNet: High Frame Rate Depth Map Reconstruction using Commodity Depth and RGB Cameras | May 24, 2023 | Depth EstimationGPU | —Unverified | 0 |
| Siamese Masked Autoencoders | May 23, 2023 | Data AugmentationDecoder | —Unverified | 0 |
| UVOSAM: A Mask-free Paradigm for Unsupervised Video Object Segmentation via Segment Anything Model | May 22, 2023 | Image SegmentationObject | CodeCode Available | 1 |
| READMem: Robust Embedding Association for a Diverse Memory in Unconstrained Video Object Segmentation | May 22, 2023 | Semantic SegmentationSemi-Supervised Video Object Segmentation | CodeCode Available | 0 |
| Video Object Segmentation in Panoptic Wild Scenes | May 8, 2023 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| Personalize Segment Anything Model with One Shot | May 4, 2023 | Image Generationmodel | CodeCode Available | 3 |