Open Vocabulary Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 113 papers

Title	Date	Tasks	Status	Hype
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive Reinforcement	Mar 9, 2025	Domain GeneralizationObject Detection	CodeCode Available	4
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images	Oct 2, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	3
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery	Dec 15, 2023	Contrastive LearningEarth Observation	CodeCode Available	3
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation	Dec 5, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
HyperSeg: Towards Universal Visual Segmentation with Large Language Model	Nov 26, 2024	Language ModelingLarge Language Model	CodeCode Available	2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Nov 15, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Aug 9, 2024	Image to textObject	CodeCode Available	2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Aug 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Aug 1, 2024	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
A Unified Framework for 3D Scene Understanding	Jul 3, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	2
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation	Jun 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation	Apr 12, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
PosSAM: Panoptic Open-vocabulary Segment Anything	Mar 14, 2024	DecoderOpen Vocabulary Panoptic Segmentation	CodeCode Available	2
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation	Dec 19, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	Oct 2, 2023	image-classificationImage Classification	CodeCode Available	2
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	Aug 4, 2023	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
SAD: Segment Any RGBD	May 23, 2023	3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation	Mar 21, 2023	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	Mar 8, 2023	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Side Adapter Network for Open-Vocabulary Semantic Segmentation	Feb 23, 2023	Language ModellingOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Open-Vocabulary Semantic Segmentation with Mask-adapted CLIP	Oct 9, 2022	Image CaptioningOpen Vocabulary Semantic Segmentation	CodeCode Available	2
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation	Jun 26, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation	Jun 11, 2025	Autonomous DrivingDomain Generalization	CodeCode Available	1
Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation	May 28, 2025	image-classificationImage Classification	CodeCode Available	1
OpenSeg-R: Improving Open-Vocabulary Segmentation via Step-by-Step Visual Reasoning	May 22, 2025	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 5Next →

All datasets PASCAL Context-59 ADE20K-150 ADE20K-847 PascalVOC-20 PASCAL Context-459 COCO-Stuff-171 Cityscapes PascalVOC-20b iSAID Cityscape-171 FAST ISPRS Potsdam

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg	mIoU	64.6	—	Unverified
2	SILC	mIoU	63.5	—	Unverified
3	CAT-Seg	mIoU	63.3	—	Unverified
4	MaskCLIP++	mIoU	62.5	—	Unverified
5	CLIPSelf	mIoU	62.3	—	Unverified
6	UMG-CLIP-L/14	mIoU	61	—	Unverified
7	SED	mIoU	60.6	—	Unverified
8	Mask-Adapter	mIoU	60.4	—	Unverified
9	EBSeg-L	mIoU	60.2	—	Unverified
10	MAFT+	mIoU	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	38.2	—	Unverified
2	MaskCLIP++	mIoU	38.2	—	Unverified
3	Mask-Adapter	mIoU	38.2	—	Unverified
4	CAT-Seg	mIoU	37.9	—	Unverified
5	SILC	mIoU	37.7	—	Unverified
6	UMG-CLIP-L/14	mIoU	36.1	—	Unverified
7	MAFT+	mIoU	36.1	—	Unverified
8	OVSeg + OpenDAS	mIoU	35.8	—	Unverified
9	SED	mIoU	35.2	—	Unverified
10	CLIPSelf	mIoU	34.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	17.3	—	Unverified
2	MaskCLIP++	mIoU	16.8	—	Unverified
3	Mask-Adapter	mIoU	16.2	—	Unverified
4	CAT-Seg	mIoU	16	—	Unverified
5	UMG-CLIP-L/14	mIoU	15.4	—	Unverified
6	MAFT+	mIoU	15.1	—	Unverified
7	SILC	mIoU	15	—	Unverified
8	PosSAM	mIoU	14.9	—	Unverified
9	FC-CLIP	mIoU	14.8	—	Unverified
10	SCAN	mIoU	14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-L/14	mIoU	97.9	—	Unverified
2	SILC	mIoU	97.6	—	Unverified
3	SCAN	mIoU	97.2	—	Unverified
4	CAT-Seg	mIoU	97	—	Unverified
5	MaskCLIP++	mIoU	96.8	—	Unverified
6	MAFT+	mIoU	96.5	—	Unverified
7	EBSeg-L	mIoU	96.4	—	Unverified
8	FC-CLIP	mIoU	95.4	—	Unverified
9	OVSeg Swin-B	mIoU	94.5	—	Unverified
10	HyperSeg	mIoU	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SILC	mIoU	25.8	—	Unverified
2	UMG-CLIP-E/14	mIoU	25.2	—	Unverified
3	MaskCLIP++	mIoU	23.9	—	Unverified
4	CAT-Seg	mIoU	23.8	—	Unverified
5	UMG-CLIP-L/14	mIoU	23.2	—	Unverified
6	Mask-Adapter	mIoU	22.7	—	Unverified
7	SED	mIoU	22.6	—	Unverified
8	MAFT+	mIoU	21.6	—	Unverified
9	EBSeg-L	mIoU	21	—	Unverified
10	FC-CLIP	mIoU	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	POMP	HIoU	39.1	—	Unverified
2	ZSSeg	HIoU	37.8	—	Unverified
3	ZegFormer	HIoU	34.8	—	Unverified
4	TTD (TCL)	mIoU	23.7	—	Unverified
5	LaVG	mIoU	23.2	—	Unverified
6	CLIP Surgery (original CLIP without any fine-tuning)	mIoU	21.9	—	Unverified
7	TTD (MaskCLIP)	mIoU	19.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FC-CLIP	mIoU	56.2	—	Unverified
2	SimSeg	mIoU	34.5	—	Unverified
3	TTD (TCL)	mIoU	32	—	Unverified
4	CLIP Surgery (CLIP without any fine-tuning)	mIoU	31.4	—	Unverified
5	TTD (MaskCLIP)	mIoU	27	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	85.4	—	Unverified
2	CAT-Seg	mIoU	82.5	—	Unverified
3	SILC	mIoU	82.5	—	Unverified
4	FC-CLIP	mIoU	81.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU-	43.9	—	Unverified
2	SegEarth-OV	mIoU-	21.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PACL	mIoU	38.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	8.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	54.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	30.89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	32.12	—	Unverified