Open Vocabulary Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 113 papers

Title	Date	Tasks	Status	Hype	Score
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models	Nov 28, 2023	Image CaptioningImage-text matching	CodeCode Available	1	5
ReME: A Data-Centric Framework for Training-Free Open-Vocabulary Segmentation	Jun 26, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1	5
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation	Nov 27, 2023	DecoderOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
SegCLIP: Patch Aggregation with Learnable Centers for Open-Vocabulary Semantic Segmentation	Nov 27, 2022	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1	5
Auto-Vocabulary Semantic Segmentation	Dec 7, 2023	Language ModelingLanguage Modelling	CodeCode Available	1	5
Semantic Library Adaptation: LoRA Retrieval and Fusion for Open-Vocabulary Semantic Segmentation	Mar 27, 2025	Domain AdaptationOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
Show or Tell? A Benchmark To Evaluate Visual and Textual Prompts in Semantic Segmentation	May 6, 2025	Open Vocabulary Semantic SegmentationPrompt Engineering	CodeCode Available	1	5
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	Dec 21, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation	Apr 15, 2023	Language ModelingLanguage Modelling	CodeCode Available	1	5
TAG: Guidance-free Open-Vocabulary Semantic Segmentation	Mar 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1	5
Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation	May 28, 2025	image-classificationImage Classification	CodeCode Available	1	5
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias	Mar 30, 2024	Multi-Label Text ClassificationOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	Jan 12, 2024	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	1	5
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation	Oct 29, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1	5
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Nov 20, 2024	3D geometry3D Semantic Segmentation	CodeCode Available	1	5
A Simple Baseline for Open-Vocabulary Semantic Segmentation with Pre-trained Vision-language Model	Dec 29, 2021	image-classificationImage Classification	CodeCode Available	1	5
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Aug 18, 2024	Autonomous DrivingDomain Adaptation	CodeCode Available	0	5
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation	Aug 31, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	0	5
Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only	Jan 1, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0	5
Test-Time Optimization for Domain Adaptive Open Vocabulary Segmentation	Jan 8, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0	5
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment	Dec 20, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0	5
A Language-Guided Benchmark for Weakly Supervised Open Vocabulary Semantic Segmentation	Feb 27, 2023	Few-Shot Semantic SegmentationLanguage Modelling	CodeCode Available	0	5
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation	Mar 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0	5
OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies	Dec 31, 2024	3DGS3D Semantic Segmentation	CodeCode Available	0	5
Test-time Contrastive Concepts for Open-world Semantic Segmentation	Jul 6, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
SILC: Improving Vision Language Pretraining with Self-Distillation	Oct 20, 2023	ClassificationContrastive Learning	—Unverified	0	0
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	May 29, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise	Sep 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
From Open-Vocabulary to Vocabulary-Free Semantic Segmentation	Feb 17, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation	Feb 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Rethinking the Global Knowledge of CLIP in Training-Free Open-Vocabulary Semantic Segmentation	Feb 5, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Personalized OVSS: Understanding Personal Concept in Open-Vocabulary Semantic Segmentation	Jul 15, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Jun 3, 2024	Domain AdaptationOpen Vocabulary Semantic Segmentation	—Unverified	0	0
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation	Mar 30, 2024	AttributeOpen Vocabulary Semantic Segmentation	—Unverified	0	0
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation	Dec 18, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Dual Semantic Guidance for Open Vocabulary Semantic Segmentation	Jan 1, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Oct 15, 2024	Image GenerationOpen Vocabulary Semantic Segmentation	—Unverified	0	0
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference	Jul 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Class Enhancement Losses with Pseudo Labels for Zero-shot Semantic Segmentation	Jan 18, 2023	Language ModelingLanguage Modelling	—Unverified	0	0
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability	Apr 10, 2025	Contrastive LearningOpen Vocabulary Semantic Segmentation	—Unverified	0	0
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Apr 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Jan 29, 2024	DecoderImage Generation	—Unverified	0	0
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation	Nov 30, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space	Jan 1, 2025	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0	0
DPSeg: Dual-Prompt Cost Volume Learning for Open-Vocabulary Semantic Segmentation	May 16, 2025	DecoderOpen Vocabulary Semantic Segmentation	—Unverified	0	0
3D Vision-Language Gaussian Splatting	Oct 10, 2024	3D ReconstructionAutonomous Driving	—Unverified	0	0
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Image Segmentation	Aug 27, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	—Unverified	0	0
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision	Mar 6, 2024	Contrastive Learningcross-modal alignment	—Unverified	0	0
Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Dec 3, 2024	3DGS3D Reconstruction	—Unverified	0	0
MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation	Apr 14, 2023	GPROpen Vocabulary Semantic Segmentation	—Unverified	0	0

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets PASCAL Context-59 ADE20K-150 ADE20K-847 PascalVOC-20 PASCAL Context-459 COCO-Stuff-171 Cityscapes PascalVOC-20b iSAID Cityscape-171 FAST ISPRS Potsdam

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg	mIoU	64.6	—	Unverified
2	SILC	mIoU	63.5	—	Unverified
3	CAT-Seg	mIoU	63.3	—	Unverified
4	MaskCLIP++	mIoU	62.5	—	Unverified
5	CLIPSelf	mIoU	62.3	—	Unverified
6	UMG-CLIP-L/14	mIoU	61	—	Unverified
7	SED	mIoU	60.6	—	Unverified
8	Mask-Adapter	mIoU	60.4	—	Unverified
9	EBSeg-L	mIoU	60.2	—	Unverified
10	MAFT+	mIoU	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	38.2	—	Unverified
2	MaskCLIP++	mIoU	38.2	—	Unverified
3	Mask-Adapter	mIoU	38.2	—	Unverified
4	CAT-Seg	mIoU	37.9	—	Unverified
5	SILC	mIoU	37.7	—	Unverified
6	UMG-CLIP-L/14	mIoU	36.1	—	Unverified
7	MAFT+	mIoU	36.1	—	Unverified
8	OVSeg + OpenDAS	mIoU	35.8	—	Unverified
9	SED	mIoU	35.2	—	Unverified
10	CLIPSelf	mIoU	34.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	17.3	—	Unverified
2	MaskCLIP++	mIoU	16.8	—	Unverified
3	Mask-Adapter	mIoU	16.2	—	Unverified
4	CAT-Seg	mIoU	16	—	Unverified
5	UMG-CLIP-L/14	mIoU	15.4	—	Unverified
6	MAFT+	mIoU	15.1	—	Unverified
7	SILC	mIoU	15	—	Unverified
8	PosSAM	mIoU	14.9	—	Unverified
9	FC-CLIP	mIoU	14.8	—	Unverified
10	SCAN	mIoU	14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-L/14	mIoU	97.9	—	Unverified
2	SILC	mIoU	97.6	—	Unverified
3	SCAN	mIoU	97.2	—	Unverified
4	CAT-Seg	mIoU	97	—	Unverified
5	MaskCLIP++	mIoU	96.8	—	Unverified
6	MAFT+	mIoU	96.5	—	Unverified
7	EBSeg-L	mIoU	96.4	—	Unverified
8	FC-CLIP	mIoU	95.4	—	Unverified
9	OVSeg Swin-B	mIoU	94.5	—	Unverified
10	HyperSeg	mIoU	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SILC	mIoU	25.8	—	Unverified
2	UMG-CLIP-E/14	mIoU	25.2	—	Unverified
3	MaskCLIP++	mIoU	23.9	—	Unverified
4	CAT-Seg	mIoU	23.8	—	Unverified
5	UMG-CLIP-L/14	mIoU	23.2	—	Unverified
6	Mask-Adapter	mIoU	22.7	—	Unverified
7	SED	mIoU	22.6	—	Unverified
8	MAFT+	mIoU	21.6	—	Unverified
9	EBSeg-L	mIoU	21	—	Unverified
10	FC-CLIP	mIoU	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	POMP	HIoU	39.1	—	Unverified
2	ZSSeg	HIoU	37.8	—	Unverified
3	ZegFormer	HIoU	34.8	—	Unverified
4	TTD (TCL)	mIoU	23.7	—	Unverified
5	LaVG	mIoU	23.2	—	Unverified
6	CLIP Surgery (original CLIP without any fine-tuning)	mIoU	21.9	—	Unverified
7	TTD (MaskCLIP)	mIoU	19.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FC-CLIP	mIoU	56.2	—	Unverified
2	SimSeg	mIoU	34.5	—	Unverified
3	TTD (TCL)	mIoU	32	—	Unverified
4	CLIP Surgery (CLIP without any fine-tuning)	mIoU	31.4	—	Unverified
5	TTD (MaskCLIP)	mIoU	27	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	85.4	—	Unverified
2	CAT-Seg	mIoU	82.5	—	Unverified
3	SILC	mIoU	82.5	—	Unverified
4	FC-CLIP	mIoU	81.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU-	43.9	—	Unverified
2	SegEarth-OV	mIoU-	21.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PACL	mIoU	38.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	8.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	54.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	30.89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	32.12	—	Unverified