| InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models | Dec 18, 2024 | Reasoning SegmentationSegmentation | CodeCode Available | 2 | 5 |
| DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries | Mar 29, 2024 | ObjectVideo Instance Segmentation | CodeCode Available | 2 | 5 |
| Det-SAM2:Technical Report on the Self-Prompting Segmentation Framework Based on Segment Anything Model 2 | Nov 28, 2024 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 2 | 5 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| Surgical SAM 2: Real-time Segment Anything in Surgical Video by Efficient Frame Pruning | Aug 15, 2024 | SegmentationVideo Segmentation | CodeCode Available | 2 | 5 |
| TSPNet: Hierarchical Feature Learning via Temporal Semantic Pyramid for Sign Language Translation | Oct 12, 2020 | Sign Language RecognitionSign Language Translation | CodeCode Available | 2 | 5 |
| XMem++: Production-level Video Segmentation From Few Annotated Frames | Jul 29, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 | 5 |
| Self-Prompting Polyp Segmentation in Colonoscopy using Hybrid Yolo-SAM 2 Model | Sep 14, 2024 | Medical Image SegmentationPolyp Segmentation | CodeCode Available | 2 | 5 |
| Simplifying Object Segmentation with PixelLib Library | Jan 20, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 2 | 5 |
| MeViS: A Large-scale Benchmark for Video Segmentation with Motion Expressions | Aug 16, 2023 | Motion Expressions Guided Video SegmentationObject | CodeCode Available | 2 | 5 |
| MemSAM: Taming Segment Anything Model for Echocardiography Video Segmentation | Jan 1, 2024 | SegmentationVideo Segmentation | CodeCode Available | 2 | 5 |
| Holmes-VAU: Towards Long-term Video Anomaly Understanding at Any Granularity | Dec 9, 2024 | Anomaly Detectiontext annotation | CodeCode Available | 2 | 5 |
| HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver | Jan 1, 2025 | Reasoning SegmentationSegmentation | CodeCode Available | 2 | 5 |
| A Survey on Deep Learning Technique for Video Segmentation | Jul 2, 2021 | Autonomous DrivingDeep Learning | CodeCode Available | 1 | 5 |
| Local-Global Context Aware Transformer for Language-Guided Video Segmentation | Mar 18, 2022 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 | 5 |
| Making a Case for 3D Convolutions for Object Segmentation in Videos | Aug 26, 2020 | DecoderSegmentation | CodeCode Available | 1 | 5 |
| EPIC-KITCHENS VISOR Benchmark: VIdeo Segmentations and Object Relations | Sep 26, 2022 | ObjectSegmentation | CodeCode Available | 1 | 5 |
| In-N-Out Generative Learning for Dense Unsupervised Video Segmentation | Mar 29, 2022 | Contrastive LearningSemantic Segmentation | CodeCode Available | 1 | 5 |
| Efficient Semantic Video Segmentation with Per-frame Inference | Feb 26, 2020 | Knowledge DistillationOptical Flow Estimation | CodeCode Available | 1 | 5 |
| 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation | Jun 11, 2024 | Referring Video Object SegmentationSegmentation | CodeCode Available | 1 | 5 |
| BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Feb 28, 2025 | Action RecognitionLine Detection | CodeCode Available | 1 | 5 |
| A Simple Video Segmenter by Tracking Objects Along Axial Trajectories | Nov 30, 2023 | GPUObject | CodeCode Available | 1 | 5 |
| Differentiable Soft-Masked Attention | Jun 1, 2022 | ObjectSegmentation | CodeCode Available | 1 | 5 |
| CamSAM2: Segment Anything Accurately in Camouflaged Videos | Mar 25, 2025 | Camouflaged Object SegmentationObject | CodeCode Available | 1 | 5 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 | 5 |