Open Vocabulary Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 113 papers

Title	Date	Tasks	Status	Hype
SegPoint: Segment Any Point Cloud via Large Language Model	Jul 18, 2024	3D Semantic SegmentationLanguage Modeling	—Unverified	0
ClearCLIP: Decomposing CLIP Representations for Dense Vision-Language Inference	Jul 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
Explore the Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation	Jul 11, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Test-time Contrastive Concepts for Open-world Semantic Segmentation	Jul 6, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
A Unified Framework for 3D Scene Understanding	Jul 3, 2024	Contrastive LearningKnowledge Distillation	CodeCode Available	2
Understanding Multi-Granularity for Open-Vocabulary Part Segmentation	Jun 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Open-Vocabulary Semantic Segmentation with Image Embedding Balancing	Jun 14, 2024	DecoderOpen Vocabulary Semantic Segmentation	CodeCode Available	1
EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding	Jun 3, 2024	Domain AdaptationOpen Vocabulary Semantic Segmentation	—Unverified	0
OpenDAS: Open-Vocabulary Domain Adaptation for 2D and 3D Segmentation	May 30, 2024	3D Instance Segmentation3D Open-Vocabulary Instance Segmentation	—Unverified	0
Parameter-efficient Fine-tuning in Hyperspherical Space for Open-vocabulary Semantic Segmentation	May 29, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
Pay Attention to Your Neighbours: Training-Free Open-Vocabulary Semantic Segmentation	Apr 12, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Training-Free Open-Vocabulary Segmentation with Offline Diffusion-Augmented Prototype Generation	Apr 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
GOV-NeSF: Generalizable Open-Vocabulary Neural Semantic Fields	Apr 1, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias	Mar 30, 2024	Multi-Label Text ClassificationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Image-to-Image Matching via Foundation Models: A New Perspective for Open-Vocabulary Semantic Segmentation	Mar 30, 2024	AttributeOpen Vocabulary Semantic Segmentation	—Unverified	0
MaskDiffusion: Exploiting Pre-trained Diffusion Models for Semantic Segmentation	Mar 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0
TAG: Guidance-free Open-Vocabulary Semantic Segmentation	Mar 17, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
PosSAM: Panoptic Open-vocabulary Segment Anything	Mar 14, 2024	DecoderOpen Vocabulary Panoptic Segmentation	CodeCode Available	2
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision	Mar 6, 2024	Contrastive Learningcross-modal alignment	—Unverified	0
Generalizable Semantic Vision Query Generation for Zero-shot Panoptic and Semantic Segmentation	Feb 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
Bridging Generative and Discriminative Models for Unified Visual Perception with Diffusion Priors	Jan 29, 2024	DecoderImage Generation	—Unverified	0
Exploring Simple Open-Vocabulary Semantic Segmentation	Jan 22, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
UMG-CLIP: A Unified Multi-Granularity Vision Generalist for Open-World Understanding	Jan 12, 2024	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Open-Vocabulary 3D Semantic Segmentation with Foundation Models	Jan 1, 2024	3D Semantic SegmentationOpen Vocabulary Semantic Segmentation	—Unverified	0
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	Dec 21, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	1
CLIP-DINOiser: Teaching CLIP a few DINO tricks for open-vocabulary semantic segmentation	Dec 19, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
SkySense: A Multi-Modal Remote Sensing Foundation Model Towards Universal Interpretation for Earth Observation Imagery	Dec 15, 2023	Contrastive LearningEarth Observation	CodeCode Available	3
Open-Vocabulary Segmentation with Semantic-Assisted Calibration	Dec 7, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Auto-Vocabulary Semantic Segmentation	Dec 7, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models	Nov 28, 2023	Image CaptioningImage-text matching	CodeCode Available	1
SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation	Nov 27, 2023	DecoderOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Uncovering Prototypical Knowledge for Weakly Open-Vocabulary Semantic Segmentation	Oct 29, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
SILC: Improving Vision Language Pretraining with Self-Distillation	Oct 20, 2023	ClassificationContrastive Learning	—Unverified	0
OV-PARTS: Towards Open-Vocabulary Part Segmentation	Oct 8, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
CLIPSelf: Vision Transformer Distills Itself for Open-Vocabulary Dense Prediction	Oct 2, 2023	image-classificationImage Classification	CodeCode Available	2
Learning Mask-aware CLIP Representations for Zero-Shot Segmentation	Sep 30, 2023	Open Vocabulary Semantic SegmentationZero Shot Segmentation	CodeCode Available	1
CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free	Sep 25, 2023	Image SegmentationObject Localization	CodeCode Available	1
Panoptic Vision-Language Feature Fields	Sep 11, 2023	Contrastive LearningInstance Segmentation	CodeCode Available	1
Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter	Sep 6, 2023	Contrastive LearningDenoising	CodeCode Available	1
AttrSeg: Open-Vocabulary Semantic Segmentation via Attribute Decomposition-Aggregation	Aug 31, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	0
Convolutions Die Hard: Open-Vocabulary Segmentation with Single Frozen Convolutional CLIP	Aug 4, 2023	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Exploring Open-Vocabulary Semantic Segmentation without Human Labels	Jun 1, 2023	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
SAD: Segment Any RGBD	May 23, 2023	3D Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
TagCLIP: Improving Discrimination Ability of Open-Vocabulary Semantic Segmentation	Apr 15, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
MVP-SEG: Multi-View Prompt Learning for Open-Vocabulary Semantic Segmentation	Apr 14, 2023	GPROpen Vocabulary Semantic Segmentation	—Unverified	0
A Closer Look at the Explainability of Contrastive Language-Image Pre-training	Apr 12, 2023	Interactive SegmentationLanguage Modelling	CodeCode Available	1
Open-Vocabulary Semantic Segmentation with Decoupled One-Pass Network	Apr 3, 2023	ClassificationLanguage Modeling	CodeCode Available	1
CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation	Mar 21, 2023	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Global Knowledge Calibration for Fast Open-Vocabulary Segmentation	Mar 16, 2023	Knowledge DistillationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models	Mar 8, 2023	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets PASCAL Context-59 ADE20K-150 ADE20K-847 PascalVOC-20 PASCAL Context-459 COCO-Stuff-171 Cityscapes PascalVOC-20b iSAID Cityscape-171 FAST ISPRS Potsdam

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg	mIoU	64.6	—	Unverified
2	SILC	mIoU	63.5	—	Unverified
3	CAT-Seg	mIoU	63.3	—	Unverified
4	MaskCLIP++	mIoU	62.5	—	Unverified
5	CLIPSelf	mIoU	62.3	—	Unverified
6	UMG-CLIP-L/14	mIoU	61	—	Unverified
7	SED	mIoU	60.6	—	Unverified
8	Mask-Adapter	mIoU	60.4	—	Unverified
9	EBSeg-L	mIoU	60.2	—	Unverified
10	MAFT+	mIoU	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	38.2	—	Unverified
2	MaskCLIP++	mIoU	38.2	—	Unverified
3	Mask-Adapter	mIoU	38.2	—	Unverified
4	CAT-Seg	mIoU	37.9	—	Unverified
5	SILC	mIoU	37.7	—	Unverified
6	UMG-CLIP-L/14	mIoU	36.1	—	Unverified
7	MAFT+	mIoU	36.1	—	Unverified
8	OVSeg + OpenDAS	mIoU	35.8	—	Unverified
9	SED	mIoU	35.2	—	Unverified
10	CLIPSelf	mIoU	34.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	17.3	—	Unverified
2	MaskCLIP++	mIoU	16.8	—	Unverified
3	Mask-Adapter	mIoU	16.2	—	Unverified
4	CAT-Seg	mIoU	16	—	Unverified
5	UMG-CLIP-L/14	mIoU	15.4	—	Unverified
6	MAFT+	mIoU	15.1	—	Unverified
7	SILC	mIoU	15	—	Unverified
8	PosSAM	mIoU	14.9	—	Unverified
9	FC-CLIP	mIoU	14.8	—	Unverified
10	SCAN	mIoU	14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-L/14	mIoU	97.9	—	Unverified
2	SILC	mIoU	97.6	—	Unverified
3	SCAN	mIoU	97.2	—	Unverified
4	CAT-Seg	mIoU	97	—	Unverified
5	MaskCLIP++	mIoU	96.8	—	Unverified
6	MAFT+	mIoU	96.5	—	Unverified
7	EBSeg-L	mIoU	96.4	—	Unverified
8	FC-CLIP	mIoU	95.4	—	Unverified
9	OVSeg Swin-B	mIoU	94.5	—	Unverified
10	HyperSeg	mIoU	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SILC	mIoU	25.8	—	Unverified
2	UMG-CLIP-E/14	mIoU	25.2	—	Unverified
3	MaskCLIP++	mIoU	23.9	—	Unverified
4	CAT-Seg	mIoU	23.8	—	Unverified
5	UMG-CLIP-L/14	mIoU	23.2	—	Unverified
6	Mask-Adapter	mIoU	22.7	—	Unverified
7	SED	mIoU	22.6	—	Unverified
8	MAFT+	mIoU	21.6	—	Unverified
9	EBSeg-L	mIoU	21	—	Unverified
10	FC-CLIP	mIoU	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	POMP	HIoU	39.1	—	Unverified
2	ZSSeg	HIoU	37.8	—	Unverified
3	ZegFormer	HIoU	34.8	—	Unverified
4	TTD (TCL)	mIoU	23.7	—	Unverified
5	LaVG	mIoU	23.2	—	Unverified
6	CLIP Surgery (original CLIP without any fine-tuning)	mIoU	21.9	—	Unverified
7	TTD (MaskCLIP)	mIoU	19.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FC-CLIP	mIoU	56.2	—	Unverified
2	SimSeg	mIoU	34.5	—	Unverified
3	TTD (TCL)	mIoU	32	—	Unverified
4	CLIP Surgery (CLIP without any fine-tuning)	mIoU	31.4	—	Unverified
5	TTD (MaskCLIP)	mIoU	27	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	85.4	—	Unverified
2	CAT-Seg	mIoU	82.5	—	Unverified
3	SILC	mIoU	82.5	—	Unverified
4	FC-CLIP	mIoU	81.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU-	43.9	—	Unverified
2	SegEarth-OV	mIoU-	21.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PACL	mIoU	38.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	8.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	54.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	30.89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	32.12	—	Unverified