| Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement | Mar 9, 2025 | Domain GeneralizationObject Detection | CodeCode Available | 4 |
| SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images | Oct 2, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 3 |
| SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery | Dec 15, 2023 | Contrastive LearningEarth Observation | CodeCode Available | 3 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 |
| ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation | Aug 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP | Oct 9, 2022 | Image CaptioningOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation | Nov 15, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| SAD: Segment Any RGBD | May 23, 2023 | 3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Jun 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Side Adapter Network for Open-Vocabulary Semantic Segmentation | Feb 23, 2023 | Language ModellingOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Apr 12, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation | Dec 5, 2024 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation | Aug 9, 2024 | Image to textObject | CodeCode Available | 2 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Dec 19, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| HyperSeg: Towards Universal Visual Segmentation with Large Language Model | Nov 26, 2024 | Language ModelingLarge Language Model | CodeCode Available | 2 |
| Efficient Redundancy Reduction for Open-Vocabulary Semantic Segmentation | Jan 29, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Nov 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation | Nov 27, 2023 | DecoderOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models | Nov 28, 2023 | Image CaptioningImage-text matching | CodeCode Available | 1 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| Learning Mask-aware CLIP Representations for Zero-Shot Segmentation | Sep 30, 2023 | Open Vocabulary Semantic SegmentationZero Shot Segmentation | CodeCode Available | 1 |
| ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation | Jun 26, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements | Nov 18, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation | Nov 27, 2022 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive Learning | Dec 9, 2022 | Contrastive Learningimage-classification | CodeCode Available | 1 |
| Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter | Sep 6, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 |
| Open-Vocabulary Semantic Segmentation with Image Embedding Balancing | Jun 14, 2024 | DecoderOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Decoupling Zero-Shot Semantic Segmentation | Dec 15, 2021 | Open Vocabulary Semantic SegmentationSegmentation | CodeCode Available | 1 |
| OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning | May 22, 2025 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open-Vocabulary Universal Image Segmentation with MaskCLIP | Aug 18, 2022 | Image SegmentationInstance Segmentation | CodeCode Available | 1 |
| OV-PARTS: Towards Open-Vocabulary Part Segmentation | Oct 8, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part Segmentation | Jan 16, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields | Apr 1, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic Segmentation | Mar 25, 2025 | cross-modal alignmentOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| FLOSS: Free Lunch in Open-vocabulary Semantic Segmentation | Apr 14, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open-Vocabulary Segmentation with Semantic-Assisted Calibration | Dec 7, 2023 | AttributeOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation | Jan 1, 2025 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open-vocabulary Semantic Segmentation with Frozen Vision-Language Models | Oct 27, 2022 | Image SegmentationLanguage Modelling | CodeCode Available | 1 |
| Exploring Simple Open-Vocabulary Semantic Segmentation | Jan 22, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model | Dec 29, 2021 | image-classificationImage Classification | CodeCode Available | 1 |
| Learning Open-vocabulary Semantic Segmentation Models From Natural Language Supervision | Jan 22, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| A Closer Look at the Explainability of Contrastive Language-Image Pre-training | Apr 12, 2023 | Interactive SegmentationLanguage Modelling | CodeCode Available | 1 |