| OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning | May 22, 2025 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| DreamMask: Boosting Open-vocabulary Panoptic Segmentation with Synthetic Data | Jan 3, 2025 | Open Vocabulary Panoptic SegmentationPanoptic Segmentation | —Unverified | 0 |
| Open-Vocabulary Panoptic Segmentation Using BERT Pre-Training of Vision-Language Multiway Transformer Model | Dec 25, 2024 | Open Vocabulary Panoptic SegmentationPanoptic Segmentation | CodeCode Available | 1 |
| EOV-Seg: Efficient Open-Vocabulary Panoptic Segmentation | Dec 11, 2024 | DecoderGPU | CodeCode Available | 1 |
| Adapting Vision-Language Model with Fine-grained Semantics for Open-Vocabulary Segmentation | Sep 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation | Aug 1, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 2 |
| PosSAM: Panoptic Open-vocabulary Segment Anything | Mar 14, 2024 | DecoderOpen Vocabulary Panoptic Segmentation | CodeCode Available | 2 |
| UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding | Jan 12, 2024 | Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation | CodeCode Available | 1 |
| 3D Open-Vocabulary Panoptic Segmentation with 2D-3D Vision-Language Distillation | Jan 4, 2024 | 3D Panoptic SegmentationAutonomous Driving | —Unverified | 0 |
| CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction | Oct 2, 2023 | image-classificationImage Classification | CodeCode Available | 2 |