| CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Nov 15, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Caltech Aerial RGB-Thermal Dataset in the Wild | Mar 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| SimpleClick: Interactive Image Segmentation with Simple Vision Transformers | Oct 20, 2022 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| Simplifying Object Segmentation with PixelLib Library | Jan 20, 2021 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Context-Aware Video Instance Segmentation | Jul 3, 2024 | Instance SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| MIST: A Simple and Scalable End-To-End 3D Medical Imaging Segmentation Framework | Jul 31, 2024 | 3D Medical Imaging SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Cascade-CLIP: Cascaded Vision-Language Embeddings Alignment for Zero-Shot Semantic Segmentation | Jun 2, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| SoftGroup for 3D Instance Segmentation on Point Clouds | Mar 3, 2022 | 3D Instance Segmentation3D Object Detection | CodeCode Available | 2 |
| Sparse Instance Activation for Real-Time Instance Segmentation | Mar 24, 2022 | Instance SegmentationObject | CodeCode Available | 2 |
| Context Encoding for Semantic Segmentation | Mar 23, 2018 | image-classificationImage Classification | CodeCode Available | 2 |
| SRFormer: Text Detection Transformer with Incorporated Segmentation and Regression | Aug 21, 2023 | Decoderregression | CodeCode Available | 2 |
| SSSegmenation: An Open Source Supervised Semantic Segmentation Toolbox Based on PyTorch | May 26, 2023 | Image SegmentationSegmentation | CodeCode Available | 2 |
| Swin3D: A Pretrained Transformer Backbone for 3D Indoor Scene Understanding | Apr 14, 2023 | 3D Object DetectionScene Understanding | CodeCode Available | 2 |
| Swin UNETR: Swin Transformers for Semantic Segmentation of Brain Tumors in MRI Images | Jan 4, 2022 | 3D Semantic SegmentationBrain Tumor Segmentation | CodeCode Available | 2 |
| Teeth3DS+: An Extended Benchmark for Intraoral 3D Scans Analysis | Oct 12, 2022 | 3D Part SegmentationSegmentation | CodeCode Available | 2 |
| Temporal Action Segmentation: An Analysis of Modern Techniques | Oct 19, 2022 | Action SegmentationSegmentation | CodeCode Available | 2 |
| Cell Detection with Star-convex Polygons | Jun 9, 2018 | Cell DetectionCell Segmentation | CodeCode Available | 2 |
| CellViT: Vision Transformers for Precise Cell Segmentation and Classification | Jun 27, 2023 | Cell DetectionCell Segmentation | CodeCode Available | 2 |
| CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models | Jan 9, 2025 | Cell SegmentationDataset Generation | CodeCode Available | 2 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| The Missing Point in Vision Transformers for Universal Image Segmentation | May 26, 2025 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction | May 27, 2024 | MambaPrediction | CodeCode Available | 2 |
| 3D UX-Net: A Large Kernel Volumetric ConvNet Modernizing Hierarchical Transformer for Medical Image Segmentation | Sep 29, 2022 | Image SegmentationMedical Image Segmentation | CodeCode Available | 2 |
| Deep Covariance Alignment for Domain Adaptive Remote Sensing Image Segmentation | Jan 9, 2024 | Image SegmentationSegmentation | CodeCode Available | 2 |
| OBSeg: Accurate and Fast Instance Segmentation Framework Using Segmentation Foundation Models with Oriented Bounding Box Prompts | Jan 16, 2024 | Amodal Instance SegmentationInstance Segmentation | CodeCode Available | 2 |