| ViLLa: Video Reasoning Segmentation with Large Language Model | Jul 18, 2024 | Image SegmentationLanguage Modeling | CodeCode Available | 1 |
| General and Task-Oriented Video Segmentation | Jul 9, 2024 | DisentanglementDiversity | CodeCode Available | 1 |
| Uni-DVPS: Unified Model for Depth-Aware Video Panoptic Segmentation | Jul 1, 2024 | Autonomous DrivingDecoder | CodeCode Available | 1 |
| SALI: Short-term Alignment and Long-term Interaction Network for Colonoscopy Video Polyp Segmentation | Jun 19, 2024 | SegmentationVideo Polyp Segmentation | CodeCode Available | 1 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 |
| Temporally Consistent Referring Video Object Segmentation with Hybrid Memory | Mar 28, 2024 | HTRObject | CodeCode Available | 1 |
| We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation Baseline | Feb 1, 2024 | BenchmarkingDomain Adaptation | CodeCode Available | 1 |
| DVIS++: Improved Decoupled Framework for Universal Video Segmentation | Dec 20, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 |
| AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform | Dec 17, 2023 | Image SegmentationSegmentation | CodeCode Available | 1 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Nov 29, 2023 | ClusteringObject | CodeCode Available | 1 |
| Concatenated Masked Autoencoders as Spatial-Temporal Learner | Nov 2, 2023 | Action RecognitionData Augmentation | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| PanoVOS: Bridging Non-panoramic and Panoramic Views with Transformer for Video Segmentation | Sep 21, 2023 | Autonomous DrivingSegmentation | CodeCode Available | 1 |
| GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation | Sep 20, 2023 | Domain AdaptationGraph Matching | CodeCode Available | 1 |
| CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation | Sep 18, 2023 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 1 |
| Stochastic positional embeddings improve masked image modeling | Jul 31, 2023 | Language ModellingMasked Language Modeling | CodeCode Available | 1 |
| Tube-Link: A Flexible Cross Tube Framework for Universal Video Segmentation | Mar 22, 2023 | Contrastive LearningSegmentation | CodeCode Available | 1 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Video-SwinUNet: Spatio-temporal Deep Learning Framework for VFSS Instance Segmentation | Feb 22, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| PolyFormer: Referring Image Segmentation as Sequential Polygon Generation | Feb 14, 2023 | DecoderImage Segmentation | CodeCode Available | 1 |
| TarViS: A Unified Approach for Target-based Video Segmentation | Jan 6, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| Context-Aware Relative Object Queries To Unify Video Instance and Panoptic Segmentation | Jan 1, 2023 | Instance SegmentationMulti-Object Tracking | CodeCode Available | 1 |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Sep 26, 2022 | ObjectSegmentation | CodeCode Available | 1 |
| Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward | Sep 25, 2022 | DecoderVideo Editing | CodeCode Available | 1 |