| Open-World Skill Discovery from Unsegmented Demonstrations | Mar 11, 2025 | Boundary DetectionEvent Segmentation | —Unverified | 0 |
| OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation | Mar 10, 2025 | Pseudo LabelSemantic Segmentation | —Unverified | 0 |
| Rethinking Few-Shot Medical Image Segmentation by SAM2: A Training-Free Framework with Augmentative Prompting and Dynamic Matching | Mar 5, 2025 | Data AugmentationFew-Shot Learning | —Unverified | 0 |
| Parameter-free Video Segmentation for Vision and Language Understanding | Mar 3, 2025 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| An Analysis of Data Transformation Effects on Segment Anything 2 | Feb 25, 2025 | Semantic SegmentationVideo Object Segmentation | —Unverified | 0 |
| Deep learning approaches to surgical video segmentation and object detection: A Scoping Review | Feb 23, 2025 | object-detectionObject Detection | —Unverified | 0 |
| Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field | Feb 22, 2025 | 2D Panoptic Segmentation3D Scene Reconstruction | —Unverified | 0 |
| Role of the Pretraining and the Adaptation data sizes for low-resource real-time MRI video segmentation | Feb 20, 2025 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| Efficient Portrait Matte Creation With Layer Diffusion and Connectivity Priors | Jan 27, 2025 | Image MattingVideo Segmentation | —Unverified | 0 |
| Efficient Frame Extraction: A Novel Approach Through Frame Similarity and Surgical Tool Tracking for Video Segmentation | Jan 19, 2025 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| Static Segmentation by Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation | Jan 12, 2025 | Image RetrievalImage Segmentation | —Unverified | 0 |
| Segment Anything Model for Zero-shot Single Particle Tracking in Liquid Phase Transmission Electron Microscopy | Jan 6, 2025 | Video SegmentationVideo Semantic Segmentation | CodeCode Available | 0 |
| EntitySAM: Segment Everything in Video | Jan 1, 2025 | DecoderObject | —Unverified | 0 |
| VideoGLaMM : A Large Multimodal Model for Pixel-Level Visual Grounding in Videos | Jan 1, 2025 | Large Language ModelVideo Segmentation | —Unverified | 0 |
| Decoupled Motion Expression Video Segmentation | Jan 1, 2025 | Instance SegmentationReferring Video Object Segmentation | —Unverified | 0 |
| Is Segment Anything Model 2 All You Need for Surgery Video Segmentation? A Systematic Evaluation | Dec 31, 2024 | AllSegmentation | —Unverified | 0 |
| Generative Video Propagation | Dec 27, 2024 | Image to Video GenerationVideo Generation | —Unverified | 0 |
| When SAM2 Meets Video Shadow and Mirror Detection | Dec 26, 2024 | Image SegmentationMirror Detection | CodeCode Available | 0 |
| Collaborative Hybrid Propagator for Temporal Misalignment in Audio-Visual Segmentation | Dec 11, 2024 | Video SegmentationVideo Semantic Segmentation | —Unverified | 0 |
| RoMo: Robust Motion Segmentation Improves Structure from Motion | Nov 27, 2024 | Camera CalibrationMotion Segmentation | —Unverified | 0 |
| Geometric Algebra Planes: Convex Implicit Neural Volumes | Nov 20, 2024 | DecoderVideo Segmentation | —Unverified | 0 |
| Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level | Nov 15, 2024 | Benchmarkingcounterfactual | —Unverified | 0 |
| Zero-shot capability of SAM-family models for bone segmentation in CT scans | Nov 13, 2024 | Image SegmentationMedical Image Segmentation | —Unverified | 0 |
| MSEG-VCUQ: Multimodal SEGmentation with Enhanced Vision Foundation Models, Convolutional Neural Networks, and Uncertainty Quantification for High-Speed Video Phase Detection Data | Nov 12, 2024 | SegmentationUncertainty Quantification | CodeCode Available | 0 |
| GaussianCut: Interactive segmentation via graph cut for 3D Gaussian Splatting | Nov 12, 2024 | 3DGSgraph construction | —Unverified | 0 |