| ViLLa: Video Reasoning Segmentation with Large Language Model | Jul 18, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| General and Task-Oriented Video Segmentation | Jul 9, 2024 | DisentanglementDiversity | CodeCode Available | 1 |
| Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation | Jul 1, 2024 | Autonomous DrivingDecoder | CodeCode Available | 1 |
| SALI: Short-term Alignment and Long-term Interaction Network for Colonoscopy Video Polyp Segmentation | Jun 19, 2024 | SegmentationVideo Polyp Segmentation | CodeCode Available | 1 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Temporally Consistent Referring Video Object Segmentation with Hybrid Memory | Mar 28, 2024 | HTRObject | CodeCode Available | 1 |
| We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline | Feb 1, 2024 | BenchmarkingDomain Adaptation | CodeCode Available | 1 |
| DVIS++: Improved Decoupled Framework for Universal Video Segmentation | Dec 20, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 |
| AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform | Dec 17, 2023 | Image SegmentationSegmentation | CodeCode Available | 1 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Nov 29, 2023 | ClusteringObject | CodeCode Available | 1 |
| Concatenated Masked Autoencoders as Spatial-Temporal Learner | Nov 2, 2023 | Action RecognitionData Augmentation | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Sep 21, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation | Sep 20, 2023 | Domain AdaptationGraph Matching | CodeCode Available | 1 |
| CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation | Sep 18, 2023 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 1 |
| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation | Mar 22, 2023 | Contrastive LearningSegmentation | CodeCode Available | 1 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation | Feb 22, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| PolyFormer: Referring Image Segmentation as Sequential Polygon Generation | Feb 14, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| TarViS: A Unified Approach for Target-based Video Segmentation | Jan 6, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Sep 26, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward | Sep 25, 2022 | DecoderVideo Editing | CodeCode Available | 1 |
| Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations | Jul 18, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| Domain Adaptive Video Segmentation via Temporal Pseudo Supervision | Jul 6, 2022 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Segmenting Moving Objects via an Object-Centric Layered Representation | Jul 5, 2022 | Instance SegmentationMotion Segmentation | CodeCode Available | 1 |
| Towards Robust Video Object Segmentation with Adaptive Object Calibration | Jul 2, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Differentiable Soft-Masked Attention | Jun 1, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Video K-Net: A Simple, Strong, and Unified Baseline for Video Segmentation | Apr 10, 2022 | Image SegmentationInstance Segmentation | CodeCode Available | 1 |
| Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation | Apr 6, 2022 | Optical Flow EstimationReferring Expression Segmentation | CodeCode Available | 1 |
| In-N-Out Generative Learning for Dense Unsupervised Video Segmentation | Mar 29, 2022 | Contrastive LearningSemantic Segmentation | CodeCode Available | 1 |
| Local-Global Context Aware Transformer for Language-Guided Video Segmentation | Mar 18, 2022 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation | Mar 8, 2022 | ClassificationInstance Segmentation | CodeCode Available | 1 |
| D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | Multi-Object Tracking and SegmentationSegmentation | CodeCode Available | 1 |
| D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Dense Unsupervised Learning for Video Segmentation | Nov 11, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| AuxAdapt: Stable and Efficient Test-Time Adaptation for Temporally Consistent Video Semantic Segmentation | Oct 24, 2021 | Optical Flow EstimationSegmentation | CodeCode Available | 1 |
| A Survey on Deep Learning Technique for Video Segmentation | Jul 2, 2021 | Autonomous DrivingDeep Learning | CodeCode Available | 1 |
| Coarse to Fine Multi-Resolution Temporal Convolutional Network | May 23, 2021 | Action SegmentationDecoder | CodeCode Available | 1 |
| Cross-Modal Progressive Comprehension for Referring Segmentation | May 15, 2021 | AttributeImage Segmentation | CodeCode Available | 1 |
| Flow-based Video Segmentation for Human Head and Shoulders | Apr 20, 2021 | DecoderImage Matting | CodeCode Available | 1 |
| Generic Event Boundary Detection: A Benchmark for Event Segmentation | Jan 26, 2021 | Action DetectionBoundary Detection | CodeCode Available | 1 |
| Making a Case for 3D Convolutions for Object Segmentation in Videos | Aug 26, 2020 | DecoderSegmentation | CodeCode Available | 1 |
| Robust Semantic Segmentation in Adverse Weather Conditions by means of Fast Video-Sequence Segmentation | Jul 1, 2020 | Image SegmentationSegmentation | CodeCode Available | 1 |
| Video Panoptic Segmentation | Jun 19, 2020 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Video Semantic Segmentation with Distortion-Aware Feature Correction | Jun 18, 2020 | Image SegmentationOptical Flow Estimation | CodeCode Available | 1 |
| Real-Time Video Inference on Edge Devices via Adaptive Model Streaming | Jun 11, 2020 | Knowledge DistillationSemantic Segmentation | CodeCode Available | 1 |
| Temporal Aggregate Representations for Long-Range Video Understanding | Jun 1, 2020 | Action AnticipationAction Recognition | CodeCode Available | 1 |