| Unleashing the Potential of SAM2 for Biomedical Images and Videos: A Survey | Aug 23, 2024 | Image SegmentationSegmentation | CodeCode Available | 5 |
| Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection | Mar 9, 2023 | DecoderObject Detection | CodeCode Available | 5 |
| Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement | Mar 9, 2025 | Domain GeneralizationObject Detection | CodeCode Available | 4 |
| ZIM: Zero-Shot Image Matting for Anything | Nov 1, 2024 | Image InpaintingImage Matting | CodeCode Available | 3 |
| Zero-Shot Surgical Tool Segmentation in Monocular Video Using Segment Anything Model 2 | Aug 3, 2024 | DiversitySegmentation | CodeCode Available | 3 |
| RobustSAM: Segment Anything Robustly on Degraded Images | Jun 13, 2024 | DeblurringImage Dehazing | CodeCode Available | 3 |
| A Simple Framework for Open-Vocabulary Segmentation and Detection | Mar 14, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 3 |
| Universal Instance Perception as Object Discovery and Retrieval | Mar 12, 2023 | Described Object DetectionGeneralized Referring Expression Comprehension | CodeCode Available | 3 |
| Generalized Decoding for Pixel, Image, and Language | Dec 21, 2022 | DecoderImage Segmentation | CodeCode Available | 3 |
| CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models | Jan 9, 2025 | Cell SegmentationDataset Generation | CodeCode Available | 2 |
| 3DGS-CD: 3D Gaussian Splatting-based Change Detection for Physical Object Rearrangement | Nov 6, 2024 | 3DGSChange Detection | CodeCode Available | 2 |
| Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations | Oct 3, 2024 | Zero Shot Segmentation | CodeCode Available | 2 |
| MedCLIP-SAMv2: Towards Universal Text-Driven Medical Image Segmentation | Sep 28, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| VCP-CLIP: A visual context prompting model for zero-shot anomaly segmentation | Jul 17, 2024 | Anomaly DetectionAnomaly Segmentation | CodeCode Available | 2 |
| DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut | Jun 5, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation | Apr 9, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation | Mar 29, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| Diffuse, Attend, and Segment: Unsupervised Zero-Shot Segmentation using Stable Diffusion | Aug 23, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models | Aug 11, 2023 | Dataset GenerationDecoder | CodeCode Available | 2 |
| Hierarchical Open-vocabulary Universal Image Segmentation | Jul 3, 2023 | Image ComprehensionImage Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Feb 23, 2023 | Language ModellingOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Language-driven Semantic Segmentation | Jan 10, 2022 | DescriptiveFew-Shot Semantic Segmentation | CodeCode Available | 2 |
| Compress Any Segment Anything Model (SAM) | Jul 11, 2025 | modelQuantization | CodeCode Available | 1 |
| Zero-Shot Tree Detection and Segmentation from Aerial Forest Imagery | Jun 3, 2025 | Image SegmentationSegmentation | CodeCode Available | 1 |
| COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training | Dec 2, 2024 | Self-Supervised LearningSemantic Segmentation | CodeCode Available | 1 |
| Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation | Sep 4, 2024 | Dichotomous Image SegmentationImage Segmentation | CodeCode Available | 1 |
| SAM-UNet:Enhancing Zero-Shot Segmentation of SAM for Universal Medical Images | Aug 19, 2024 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| MeshSegmenter: Zero-Shot Mesh Semantic Segmentation via Texture Synthesis | Jul 18, 2024 | 3D Semantic SegmentationSegmentation | CodeCode Available | 1 |
| Frenet-Serret Frame-based Decomposition for Part Segmentation of 3D Curvilinear Structures | Apr 19, 2024 | ARCSegmentation | CodeCode Available | 1 |
| TV-SAM: Increasing Zero-Shot Segmentation Performance on Multimodal Medical Images Using GPT-4 Generated Descriptive Prompts Without Human Annotation | Feb 24, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| MatSAM: Efficient Extraction of Microstructures of Materials via Visual Large Model | Jan 11, 2024 | Image SegmentationPrompt Engineering | CodeCode Available | 1 |
| Spectral Prompt Tuning:Unveiling Unseen Classes for Zero-Shot Semantic Segmentation | Dec 20, 2023 | DecoderSemantic Segmentation | CodeCode Available | 1 |
| Grounding Everything: Emerging Localization Properties in Vision-Language Transformers | Dec 1, 2023 | Image RetrievalObject Localization | CodeCode Available | 1 |
| GeoSAM: Fine-tuning SAM with Multi-Modal Prompts for Mobility Infrastructure Segmentation | Nov 19, 2023 | Image SegmentationLarge Language Model | CodeCode Available | 1 |
| Learning Mask-aware CLIP Representations for Zero-Shot Segmentation | Sep 30, 2023 | Open Vocabulary Semantic SegmentationZero Shot Segmentation | CodeCode Available | 1 |
| MediViSTA: Medical Video Segmentation via Temporal Fusion SAM Adaptation for Echocardiography | Sep 24, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Zero-Shot Edge Detection with SCESAME: Spectral Clustering-based Ensemble for Segment Anything Model Estimation | Aug 26, 2023 | ClusteringEdge Detection | CodeCode Available | 1 |
| TongueSAM: An Universal Tongue Segmentation Model Based on SAM with Zero-Shot | Aug 12, 2023 | DiagnosticInteractive Segmentation | CodeCode Available | 1 |
| Training-free Object Counting with Prompts | Jun 30, 2023 | ObjectObject Counting | CodeCode Available | 1 |
| How to Efficiently Adapt Large Segmentation Model(SAM) to Medical Images | Jun 23, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Primitive Generation and Semantic-related Alignment for Universal Zero-Shot Segmentation | Jun 19, 2023 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| TomoSAM: a 3D Slicer extension using SAM for tomography segmentation | Jun 14, 2023 | 3D Part SegmentationImage Segmentation | CodeCode Available | 1 |
| PaintSeg: Training-free Segmentation via Painting | May 30, 2023 | Referring Image Matting (Prompt-based)Segmentation | CodeCode Available | 1 |
| One-Prompt to Segment All Medical Images | May 17, 2023 | AllImage Segmentation | CodeCode Available | 1 |
| Segment Anything Model for Medical Images? | Apr 28, 2023 | Image SegmentationMedical Image Segmentation | CodeCode Available | 1 |
| Generalist Vision Foundation Models for Medical Imaging: A Case Study of Segment Anything Model on Zero-Shot Medical Segmentation | Apr 25, 2023 | Computed Tomography (CT)Image Segmentation | CodeCode Available | 1 |
| Segment Anything Model for Medical Image Analysis: an Experimental Study | Apr 20, 2023 | Image SegmentationInteractive Segmentation | CodeCode Available | 1 |
| A Closer Look at the Explainability of Contrastive Language-Image Pre-training | Apr 12, 2023 | Interactive SegmentationLanguage Modelling | CodeCode Available | 1 |
| ZegOT: Zero-shot Segmentation Through Optimal Transport of Text Prompts | Jan 28, 2023 | SegmentationSemantic Segmentation | CodeCode Available | 1 |