SOTAVerified

Open Vocabulary Semantic Segmentation

Papers

Showing 2650 of 113 papers

TitleStatusHype
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-FreeCode1
Learning Mask-aware CLIP Representations for Zero-Shot SegmentationCode1
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary SegmentationCode1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural EnhancementsCode1
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic SegmentationCode1
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic SegmentationCode1
Open Vocabulary Semantic Segmentation with Patch Aligned Contrastive LearningCode1
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic SegmenterCode1
Open-Vocabulary Semantic Segmentation with Image Embedding BalancingCode1
Decoupling Zero-Shot Semantic SegmentationCode1
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual ReasoningCode1
Open-Vocabulary Universal Image Segmentation with MaskCLIPCode1
OV-PARTS: Towards Open-Vocabulary Part SegmentationCode1
Fine-Grained Image-Text Correspondence with Cost Aggregation for Open-Vocabulary Part SegmentationCode1
Global Knowledge Calibration for Fast Open-Vocabulary SegmentationCode1
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic FieldsCode1
LPOSS: Label Propagation Over Patches and Pixels for Open-vocabulary Semantic SegmentationCode1
FLOSS: Free Lunch in Open-vocabulary Semantic SegmentationCode1
Open-Vocabulary Segmentation with Semantic-Assisted CalibrationCode1
FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic SegmentationCode1
Open-vocabulary Semantic Segmentation with Frozen Vision-Language ModelsCode1
Exploring Simple Open-Vocabulary Semantic SegmentationCode1
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language ModelCode1
Learning Open-vocabulary Semantic Segmentation Models From Natural Language SupervisionCode1
A Closer Look at the Explainability of Contrastive Language-Image Pre-trainingCode1
Show:102550
← PrevPage 2 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HyperSegmIoU64.6Unverified
2SILCmIoU63.5Unverified
3CAT-SegmIoU63.3Unverified
4MaskCLIP++mIoU62.5Unverified
5CLIPSelfmIoU62.3Unverified
6UMG-CLIP-L/14mIoU61Unverified
7SEDmIoU60.6Unverified
8Mask-AdaptermIoU60.4Unverified
9EBSeg-LmIoU60.2Unverified
10MAFT+mIoU59.4Unverified
#ModelMetricClaimedVerifiedStatus
1UMG-CLIP-E/14mIoU38.2Unverified
2Mask-AdaptermIoU38.2Unverified
3MaskCLIP++mIoU38.2Unverified
4CAT-SegmIoU37.9Unverified
5SILCmIoU37.7Unverified
6MAFT+mIoU36.1Unverified
7UMG-CLIP-L/14mIoU36.1Unverified
8OVSeg + OpenDASmIoU35.8Unverified
9SEDmIoU35.2Unverified
10CLIPSelfmIoU34.5Unverified
#ModelMetricClaimedVerifiedStatus
1UMG-CLIP-E/14mIoU17.3Unverified
2MaskCLIP++mIoU16.8Unverified
3Mask-AdaptermIoU16.2Unverified
4CAT-SegmIoU16Unverified
5UMG-CLIP-L/14mIoU15.4Unverified
6MAFT+mIoU15.1Unverified
7SILCmIoU15Unverified
8PosSAMmIoU14.9Unverified
9FC-CLIPmIoU14.8Unverified
10SCANmIoU14Unverified
#ModelMetricClaimedVerifiedStatus
1UMG-CLIP-L/14mIoU97.9Unverified
2SILCmIoU97.6Unverified
3SCANmIoU97.2Unverified
4CAT-SegmIoU97Unverified
5MaskCLIP++mIoU96.8Unverified
6MAFT+mIoU96.5Unverified
7EBSeg-LmIoU96.4Unverified
8FC-CLIPmIoU95.4Unverified
9OVSeg Swin-BmIoU94.5Unverified
10MAFT-ViTLmIoU92.1Unverified
#ModelMetricClaimedVerifiedStatus
1SILCmIoU25.8Unverified
2UMG-CLIP-E/14mIoU25.2Unverified
3MaskCLIP++mIoU23.9Unverified
4CAT-SegmIoU23.8Unverified
5UMG-CLIP-L/14mIoU23.2Unverified
6Mask-AdaptermIoU22.7Unverified
7SEDmIoU22.6Unverified
8MAFT+mIoU21.6Unverified
9EBSeg-LmIoU21Unverified
10FC-CLIPmIoU18.2Unverified
#ModelMetricClaimedVerifiedStatus
1POMPHIoU39.1Unverified
2ZSSegHIoU37.8Unverified
3ZegFormerHIoU34.8Unverified
4TTD (TCL)mIoU23.7Unverified
5LaVGmIoU23.2Unverified
6CLIP Surgery (original CLIP without any fine-tuning)mIoU21.9Unverified
7TTD (MaskCLIP)mIoU19.4Unverified
#ModelMetricClaimedVerifiedStatus
1FC-CLIPmIoU56.2Unverified
2SimSegmIoU34.5Unverified
3TTD (TCL)mIoU32Unverified
4CLIP Surgery (CLIP without any fine-tuning)mIoU31.4Unverified
5TTD (MaskCLIP)mIoU27Unverified
#ModelMetricClaimedVerifiedStatus
1UMG-CLIP-E/14mIoU85.4Unverified
2CAT-SegmIoU82.5Unverified
3SILCmIoU82.5Unverified
4FC-CLIPmIoU81.8Unverified
#ModelMetricClaimedVerifiedStatus
1SkySense-OmIoU-43.9Unverified
2SegEarth-OVmIoU-21.7Unverified
#ModelMetricClaimedVerifiedStatus
1PACLmIoU38.8Unverified
#ModelMetricClaimedVerifiedStatus
1SkySense-OmIoU8.3Unverified
#ModelMetricClaimedVerifiedStatus
1SkySense-OmIoU54.1Unverified
#ModelMetricClaimedVerifiedStatus
1SkySense-OmIoU30.89Unverified
#ModelMetricClaimedVerifiedStatus
1SkySense-OmIoU32.12Unverified