| Open-World Skill Discovery from Unsegmented Demonstrations | Mar 11, 2025 | Boundary DetectionEvent Segmentation | —Unverified | 0 |
| OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation | Mar 10, 2025 | Pseudo LabelSemantic Segmentation | —Unverified | 0 |
| Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching | Mar 5, 2025 | Data AugmentationFew-Shot Learning | —Unverified | 0 |
| Parameter-free Video Segmentation for Vision and Language Understanding | Mar 3, 2025 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports | Feb 28, 2025 | Action RecognitionLine Detection | CodeCode Available | 1 |
| An Analysis of Data Transformation Effects on Segment Anything 2 | Feb 25, 2025 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Feb 23, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Feb 22, 2025 | 2D Panoptic Segmentation3D Scene Reconstruction | —Unverified | 0 |
| Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation | Feb 20, 2025 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| SASVi - Segment Any Surgical Video | Feb 12, 2025 | SegmentationVideo Segmentation | CodeCode Available | 1 |
| Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors | Jan 27, 2025 | Image MattingVideo Segmentation | —Unverified | 0 |
| MPG-SAM 2: Adapting SAM 2 with Mask Priors and Global Context for Referring Video Object Segmentation | Jan 23, 2025 | Referring Expression SegmentationReferring Video Object Segmentation | CodeCode Available | 1 |
| Efficient Frame Extraction: A Novel Approach Through Frame Similarity and Surgical Tool Tracking for Video Segmentation | Jan 19, 2025 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks | Jan 17, 2025 | Few-Shot Semantic SegmentationSegmentation | CodeCode Available | 1 |
| EdgeTAM: On-Device Track Anything Model | Jan 13, 2025 | modelVideo Segmentation | CodeCode Available | 4 |
| VidChain: Chain-of-Tasks with Metric-based Direct Preference Optimization for Dense Video Captioning | Jan 12, 2025 | Dense Video CaptioningVideo Captioning | CodeCode Available | 1 |
| Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation | Jan 12, 2025 | Image RetrievalImage Segmentation | —Unverified | 0 |
| Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos | Jan 7, 2025 | 2kLanguage Modeling | CodeCode Available | 5 |
| Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy | Jan 6, 2025 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| EntitySAM: Segment Everything in Video | Jan 1, 2025 | DecoderObject | —Unverified | 0 |
| Decoupled Motion Expression Video Segmentation | Jan 1, 2025 | Instance SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Jan 1, 2025 | Large Language ModelVideo Segmentation | —Unverified | 0 |
| HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver | Jan 1, 2025 | Reasoning SegmentationSegmentation | CodeCode Available | 2 |
| Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation | Dec 31, 2024 | AllSegmentation | —Unverified | 0 |
| Generative Video Propagation | Dec 27, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 |