| SegPoint: Segment Any Point Cloud via Large Language Model | Jul 18, 2024 | 3D Semantic SegmentationLanguage Modeling | —Unverified | 0 |
| ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference | Jul 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation | Jul 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Test-time Contrastive Concepts for Open-world Semantic Segmentation | Jul 6, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| A Unified Framework for 3D Scene Understanding | Jul 3, 2024 | Contrastive LearningKnowledge Distillation | CodeCode Available | 2 |
| Understanding Multi-Granularity for Open-Vocabulary Part Segmentation | Jun 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Open-Vocabulary Semantic Segmentation with Image Embedding Balancing | Jun 14, 2024 | DecoderOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding | Jun 3, 2024 | Domain AdaptationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation | May 30, 2024 | 3D Instance Segmentation3D Open-Vocabulary Instance Segmentation | —Unverified | 0 |
| Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation | May 29, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation | Apr 12, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation | Apr 9, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields | Apr 1, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias | Mar 30, 2024 | Multi-Label Text ClassificationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation | Mar 30, 2024 | AttributeOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation | Mar 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 0 |
| TAG: Guidance-free Open-Vocabulary Semantic Segmentation | Mar 17, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 |
| Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision | Mar 6, 2024 | Contrastive Learningcross-modal alignment | —Unverified | 0 |
| Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation | Feb 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors | Jan 29, 2024 | DecoderImage Generation | —Unverified | 0 |
| Exploring Simple Open-Vocabulary Semantic Segmentation | Jan 22, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding | Jan 12, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open-Vocabulary 3D Semantic Segmentation with Foundation Models | Jan 1, 2024 | 3D Semantic SegmentationOpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification | Dec 21, 2023 | AttributeOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation | Dec 19, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery | Dec 15, 2023 | Contrastive LearningEarth Observation | CodeCode Available | 3 |
| Open-Vocabulary Segmentation with Semantic-Assisted Calibration | Dec 7, 2023 | AttributeOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Auto-Vocabulary Semantic Segmentation | Dec 7, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models | Nov 28, 2023 | Image CaptioningImage-text matching | CodeCode Available | 1 |
| SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation | Nov 27, 2023 | DecoderOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation | Oct 29, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| SILC: Improving Vision Language Pretraining with Self-Distillation | Oct 20, 2023 | ClassificationContrastive Learning | —Unverified | 0 |
| OV-PARTS: Towards Open-Vocabulary Part Segmentation | Oct 8, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |
| Learning Mask-aware CLIP Representations for Zero-Shot Segmentation | Sep 30, 2023 | Open Vocabulary Semantic SegmentationZero Shot Segmentation | CodeCode Available | 1 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 |
| Panoptic Vision-Language Feature Fields | Sep 11, 2023 | Contrastive LearningInstance Segmentation | CodeCode Available | 1 |
| Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter | Sep 6, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 |
| AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation | Aug 31, 2023 | AttributeOpen Vocabulary Semantic Segmentation | CodeCode Available | 0 |
| Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP | Aug 4, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Exploring Open-Vocabulary Semantic Segmentation without Human Labels | Jun 1, 2023 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | —Unverified | 0 |
| SAD: Segment Any RGBD | May 23, 2023 | 3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation | Apr 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation | Apr 14, 2023 | GPROpen Vocabulary Semantic Segmentation | —Unverified | 0 |
| A Closer Look at the Explainability of Contrastive Language-Image Pre-training | Apr 12, 2023 | Interactive SegmentationLanguage Modelling | CodeCode Available | 1 |
| Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network | Apr 3, 2023 | ClassificationLanguage Modeling | CodeCode Available | 1 |
| CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation | Mar 21, 2023 | Image SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| Global Knowledge Calibration for Fast Open-Vocabulary Segmentation | Mar 16, 2023 | Knowledge DistillationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models | Mar 8, 2023 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |