| Open-Set Domain Adaptation for Semantic Segmentation | May 30, 2024 | Domain AdaptationSemantic Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Attention Maps with Token Optimization for Semantic Segmentation in Diffusion Models | Mar 21, 2024 | Image GenerationSemantic Segmentation | CodeCode Available | 2 |
| DAMamba: Vision State Space Model with Dynamic Adaptive Scan | Feb 18, 2025 | image-classificationImage Classification | CodeCode Available | 2 |
| DatasetDM: Synthesizing Data with Perception Annotations Using Diffusion Models | Aug 11, 2023 | Dataset GenerationDecoder | CodeCode Available | 2 |
| Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Jul 16, 2024 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification | Dec 1, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| Customized Segment Anything Model for Medical Image Segmentation | Apr 26, 2023 | DecoderImage Segmentation | CodeCode Available | 2 |
| Dataset Quantization | Aug 21, 2023 | Dataset Distillationobject-detection | CodeCode Available | 2 |
| Parameter-Inverted Image Pyramid Networks | Jun 6, 2024 | Computational Efficiencyimage-classification | CodeCode Available | 2 |
| PartSTAD: 2D-to-3D Part Segmentation Task Adaptation | Jan 11, 2024 | 3D Part SegmentationForeground Segmentation | CodeCode Available | 2 |
| PEM: Prototype-based Efficient MaskFormer for Image Segmentation | Feb 29, 2024 | Image SegmentationPanoptic Segmentation | CodeCode Available | 2 |
| Perceive Anything: Recognize, Explain, Caption, and Segment Anything in Images and Videos | Jun 5, 2025 | GPUSemantic Segmentation | CodeCode Available | 2 |
| A Multi-objective Optimization Benchmark Test Suite for Real-time Semantic Segmentation | Apr 25, 2024 | Autonomous DrivingEvolutionary Algorithms | CodeCode Available | 2 |
| Cross-Image Relational Knowledge Distillation for Semantic Segmentation | Apr 14, 2022 | Knowledge DistillationSegmentation | CodeCode Available | 2 |
| Cross Language Image Matching for Weakly Supervised Semantic Segmentation | Mar 5, 2022 | ObjectSemantic Segmentation | CodeCode Available | 2 |
| CrossEarth: Geospatial Vision Foundation Model for Domain Generalizable Remote Sensing Semantic Segmentation | Oct 30, 2024 | Domain AdaptationDomain Generalization | CodeCode Available | 2 |
| PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images | Sep 20, 2024 | Image SegmentationSemantic Segmentation | CodeCode Available | 2 |
| A Data-scalable Transformer for Medical Image Segmentation: Architecture, Model Efficiency, and Benchmark | Feb 28, 2022 | Image SegmentationInductive Bias | CodeCode Available | 2 |
| CrossFormer++: A Versatile Vision Transformer Hinging on Cross-scale Attention | Mar 13, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| CAS-ViT: Convolutional Additive Self-attention Vision Transformers for Efficient Mobile Applications | Aug 7, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images | Mar 21, 2025 | Image SegmentationMamba | CodeCode Available | 2 |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Aug 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| DAT++: Spatially Dynamic Vision Transformer with Deformable Attention | Sep 4, 2023 | Image ClassificationInstance Segmentation | CodeCode Available | 2 |
| Deep Spectral Methods: A Surprisingly Strong Baseline for Unsupervised Semantic Segmentation and Localization | May 16, 2022 | graph partitioningSegmentation | CodeCode Available | 2 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| PyMIC: A deep learning toolkit for annotation-efficient medical image segmentation | Aug 19, 2022 | Deep LearningImage Segmentation | CodeCode Available | 2 |
| CAT-SAM: Conditional Tuning for Few-Shot Adaptation of Segment Anything Model | Feb 6, 2024 | DecoderImage Segmentation | CodeCode Available | 2 |
| CDDFuse: Correlation-Driven Dual-Branch Feature Decomposition for Multi-Modality Image Fusion | Nov 26, 2022 | object-detectionObject Detection | CodeCode Available | 2 |
| FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation | Mar 8, 2024 | Federated LearningImage Segmentation | CodeCode Available | 2 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| LVOS: A Benchmark for Large-scale Long-term Video Object Segmentation | Apr 30, 2024 | AttributeSemantic Segmentation | CodeCode Available | 2 |
| ReCLIP++: Learn to Rectify the Bias of CLIP for Unsupervised Semantic Segmentation | Aug 13, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 2 |
| RefMask3D: Language-Guided Transformer for 3D Referring Segmentation | Jul 25, 2024 | 3D visual groundingImage Segmentation | CodeCode Available | 2 |
| RegionPLC: Regional Point-Language Contrastive Learning for Open-World 3D Scene Understanding | Apr 3, 2023 | Contrastive LearningInstance Segmentation | CodeCode Available | 2 |
| Alleviating Textual Reliance in Medical Language-guided Segmentation via Prototype-driven Semantic Approximation | Jul 15, 2025 | Image SegmentationSegmentation | CodeCode Available | 2 |
| ResT V2: Simpler, Faster and Stronger | Apr 15, 2022 | Semantic Segmentation | CodeCode Available | 2 |
| Rethinking Data Augmentation for Robust LiDAR Semantic Segmentation in Adverse Weather | Jul 2, 2024 | Data AugmentationLIDAR Semantic Segmentation | CodeCode Available | 2 |
| Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting | Mar 18, 2025 | Instance SegmentationObject | CodeCode Available | 2 |
| Rethinking Interactive Image Segmentation with Low Latency High Quality and Diverse Prompts | Jan 1, 2024 | Image SegmentationInteractive Segmentation | CodeCode Available | 2 |
| CellViT: Vision Transformers for Precise Cell Segmentation and Classification | Jun 27, 2023 | Cell DetectionCell Segmentation | CodeCode Available | 2 |
| SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model | May 3, 2023 | Instance SegmentationObject | CodeCode Available | 2 |
| COVID-CT-Mask-Net: Prediction of COVID-19 from CT Scans Using Regional Features | Oct 14, 2020 | COVID-19 DiagnosisCOVID-19 Image Segmentation | CodeCode Available | 1 |
| CoTr: Efficiently Bridging CNN and Transformer for 3D Medical Image Segmentation | Mar 4, 2021 | Image SegmentationInductive Bias | CodeCode Available | 1 |
| CP2: Copy-Paste Contrastive Pretraining for Semantic Segmentation | Mar 22, 2022 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| CosPGD: an efficient white-box adversarial attack for pixel-wise prediction tasks | Feb 4, 2023 | Adversarial AttackAdversarial Robustness | CodeCode Available | 1 |
| COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes | Oct 31, 2024 | SegmentationSemantic Segmentation | CodeCode Available | 1 |
| Co-training with High-Confidence Pseudo Labels for Semi-supervised Medical Image Segmentation | Jan 11, 2023 | Image SegmentationLeft Atrium Segmentation | CodeCode Available | 1 |
| CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation | Jul 19, 2023 | Representation LearningScene Understanding | CodeCode Available | 1 |
| Co-segmentation Inspired Attention Module for Video-based Computer Vision Tasks | Nov 14, 2021 | Action ClassificationObject | CodeCode Available | 1 |
| Co-Scale Conv-Attentional Image Transformers | Apr 13, 2021 | Instance Segmentationobject-detection | CodeCode Available | 1 |