| CATR: Combinatorial-Dependence Audio-Queried Transformer for Audio-Visual Video Segmentation | Sep 18, 2023 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 1 |
| Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation | Apr 6, 2022 | Optical Flow EstimationReferring Expression Segmentation | CodeCode Available | 1 |
| Multi-Granularity Video Object Segmentation | Dec 2, 2024 | ObjectSegmentation | CodeCode Available | 1 |
| Multi-modal Segment Assemblage Network for Ad Video Editing with Importance-Coherence Reward | Sep 25, 2022 | DecoderVideo Editing | CodeCode Available | 1 |
| BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Feb 28, 2025 | Action RecognitionLine Detection | CodeCode Available | 1 |
| Dense Unsupervised Learning for Video Segmentation | Nov 11, 2021 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| RankSeg: Adaptive Pixel Classification with Image Category Ranking for Segmentation | Mar 8, 2022 | ClassificationInstance Segmentation | CodeCode Available | 1 |
| Physarum Powered Differentiable Linear Programming Layers and Applications | Apr 30, 2020 | Few-Shot LearningMeta-Learning | CodeCode Available | 1 |
| Robust Semantic Segmentation in Adverse Weather Conditions by means of Fast Video-Sequence Segmentation | Jul 1, 2020 | Image SegmentationSegmentation | CodeCode Available | 1 |
| SASVi - Segment Any Surgical Video | Feb 12, 2025 | SegmentationVideo Segmentation | CodeCode Available | 1 |
| Decoupled Seg Tokens Make Stronger Reasoning Video Segmenter and Grounder | Jun 28, 2025 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| A Survey on Deep Learning Technique for Video Segmentation | Jul 2, 2021 | Autonomous DrivingDeep Learning | CodeCode Available | 1 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 |
| Local-Global Context Aware Transformer for Language-Guided Video Segmentation | Mar 18, 2022 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Making a Case for 3D Convolutions for Object Segmentation in Videos | Aug 26, 2020 | DecoderSegmentation | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Betrayed by Attention: A Simple yet Effective Approach for Self-supervised Video Object Segmentation | Nov 29, 2023 | ClusteringObject | CodeCode Available | 1 |
| AuxAdapt: Stable and Efficient Test-Time Adaptation for Temporally Consistent Video Semantic Segmentation | Oct 24, 2021 | Optical Flow EstimationSegmentation | CodeCode Available | 1 |
| Adversarial Pixel Restoration as a Pretext Task for Transferable Perturbations | Jul 18, 2022 | object-detectionObject Detection | CodeCode Available | 1 |
| AutoVisual Fusion Suite: A Comprehensive Evaluation of Image Segmentation and Voice Conversion Tools on HuggingFace Platform | Dec 17, 2023 | Image SegmentationSegmentation | CodeCode Available | 1 |
| GraphEcho: Graph-Driven Unsupervised Domain Adaptation for Echocardiogram Video Segmentation | Sep 20, 2023 | Domain AdaptationGraph Matching | CodeCode Available | 1 |
| Generic Event Boundary Detection: A Benchmark for Event Segmentation | Jan 26, 2021 | Action DetectionBoundary Detection | CodeCode Available | 1 |
| DC-SAM: In-Context Segment Anything in Images and Videos via Dual Consistency | Apr 16, 2025 | Few-Shot LearningInteractive Segmentation | CodeCode Available | 1 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| D2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos | Nov 15, 2021 | Multi-Object Tracking and SegmentationSegmentation | CodeCode Available | 1 |