| RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models | Jan 12, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |
| FOCUS: Towards Universal Foreground Segmentation | Jan 9, 2025 | Camouflaged Object SegmentationDefocus Blur Detection | CodeCode Available | 2 |
| CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models | Jan 9, 2025 | Cell SegmentationDataset Generation | CodeCode Available | 2 |
| LesionLocator: Zero-Shot Universal Tumor Segmentation and Tracking in 3D Whole-Body Imaging | Jan 1, 2025 | Lesion SegmentationSegmentation | CodeCode Available | 2 |
| nnWNet: Rethinking the Use of Transformers in Biomedical Image Segmentation and Calling for a Unified Evaluation Benchmark | Jan 1, 2025 | BenchmarkingImage Segmentation | CodeCode Available | 2 |
| HyperSeg: Hybrid Segmentation Assistant with Fine-grained Visual Perceiver | Jan 1, 2025 | Reasoning SegmentationSegmentation | CodeCode Available | 2 |
| CGCOD: Class-Guided Camouflaged Object Detection | Dec 25, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Learnable Prompting SAM-induced Knowledge Distillation for Semi-supervised Medical Image Segmentation | Dec 18, 2024 | Image SegmentationKnowledge Distillation | CodeCode Available | 2 |
| InstructSeg: Unifying Instructed Visual Segmentation with Multi-modal Large Language Models | Dec 18, 2024 | Reasoning SegmentationSegmentation | CodeCode Available | 2 |
| FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation | Dec 12, 2024 | Cross-Domain Few-ShotDomain Generalization | CodeCode Available | 2 |
| Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation | Dec 5, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Talking to DINO: Bridging Self-Supervised Vision Backbones with Language for Open-Vocabulary Segmentation | Nov 28, 2024 | Segmentation | CodeCode Available | 2 |
| vesselFM: A Foundation Model for Universal 3D Blood Vessel Segmentation | Nov 26, 2024 | Image SegmentationMedical Image Analysis | CodeCode Available | 2 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| UltraSam: A Foundation Model for Ultrasound using Large Open-Access Segmentation Datasets | Nov 25, 2024 | Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Online Semantic Mapping for SLAM | Nov 22, 2024 | SegmentationSemantic SLAM | CodeCode Available | 2 |
| Find Any Part in 3D | Nov 20, 2024 | 3D Part SegmentationDiversity | CodeCode Available | 2 |
| CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Nov 15, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation | Nov 14, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| On Deep Learning for Geometric and Semantic Scene Understanding Using On-Vehicle 3D LiDAR | Nov 1, 2024 | 3D Semantic SegmentationAutonomous Driving | CodeCode Available | 2 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| DI-MaskDINO: A Joint Object Detection and Instance Segmentation Model | Oct 22, 2024 | DecoderInstance Segmentation | CodeCode Available | 2 |
| High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Oct 14, 2024 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| Text4Seg: Reimagining Image Segmentation as Text Generation | Oct 13, 2024 | Image SegmentationReferring Expression | CodeCode Available | 2 |
| Interactive4D: Interactive 4D LiDAR Segmentation | Oct 10, 2024 | Interactive SegmentationSegmentation | CodeCode Available | 2 |