Open Vocabulary Semantic Segmentation

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 113 papers

Title	Date	Tasks	Status	Hype
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment	Dec 20, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	0
Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation	Dec 18, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
MaskCLIP++: A Mask-Based CLIP Fine-tuning Framework for Open-Vocabulary Image Segmentation	Dec 16, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
VLMs meet UDA: Boosting Transferability of Open Vocabulary Segmentation with Unsupervised Domain Adaptation	Dec 12, 2024	Domain AdaptationOpen Vocabulary Semantic Segmentation	—Unverified	0
Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation	Dec 5, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2
Multi-robot autonomous 3D reconstruction using Gaussian splatting with Semantic guidance	Dec 3, 2024	3DGS3D Reconstruction	—Unverified	0
LMSeg: Unleashing the Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation	Nov 30, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
Distilling Spectral Graph for Object-Context Aware Open-Vocabulary Semantic Segmentation	Nov 26, 2024	ObjectOpen Vocabulary Semantic Segmentation	—Unverified	0
HyperSeg: Towards Universal Visual Segmentation with Large Language Model	Nov 26, 2024	Language ModelingLarge Language Model	CodeCode Available	2
Effective SAM Combination for Open-Vocabulary Semantic Segmentation	Nov 22, 2024	DecoderLanguage Modeling	—Unverified	0
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Nov 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
XMask3D: Cross-modal Mask Reasoning for Open Vocabulary 3D Semantic Segmentation	Nov 20, 2024	3D geometry3D Semantic Segmentation	CodeCode Available	1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	Nov 18, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Nov 15, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
InvSeg: Test-Time Prompt Inversion for Semantic Segmentation	Oct 15, 2024	Image GenerationOpen Vocabulary Semantic Segmentation	—Unverified	0
3D Vision-Language Gaussian Splatting	Oct 10, 2024	3D ReconstructionAutonomous Driving	—Unverified	0
Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments	Oct 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images	Oct 2, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	3
Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels	Sep 30, 2024	Online ClusteringOpen Vocabulary Semantic Segmentation	—Unverified	0
CUS3D :CLIP-based Unsupervised 3D Segmentation via Object-level Denoise	Sep 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	—Unverified	0
MROVSeg: Breaking the Resolution Curse of Vision-Language Models in Open-Vocabulary Image Segmentation	Aug 27, 2024	Image SegmentationOpen Vocabulary Semantic Segmentation	—Unverified	0
OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras	Aug 18, 2024	Autonomous DrivingDomain Adaptation	CodeCode Available	0
In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation	Aug 9, 2024	Image to textObject	CodeCode Available	2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Aug 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Collaborative Vision-Text Representation Optimizing for Open-Vocabulary Segmentation	Aug 1, 2024	Open Vocabulary Panoptic SegmentationOpen Vocabulary Semantic Segmentation	CodeCode Available	2

Show:10 25 50

← PrevPage 2 of 5Next →

All datasets PASCAL Context-59 ADE20K-150 ADE20K-847 PascalVOC-20 PASCAL Context-459 COCO-Stuff-171 Cityscapes PascalVOC-20b iSAID Cityscape-171 FAST ISPRS Potsdam

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	HyperSeg	mIoU	64.6	—	Unverified
2	SILC	mIoU	63.5	—	Unverified
3	CAT-Seg	mIoU	63.3	—	Unverified
4	MaskCLIP++	mIoU	62.5	—	Unverified
5	CLIPSelf	mIoU	62.3	—	Unverified
6	UMG-CLIP-L/14	mIoU	61	—	Unverified
7	SED	mIoU	60.6	—	Unverified
8	Mask-Adapter	mIoU	60.4	—	Unverified
9	EBSeg-L	mIoU	60.2	—	Unverified
10	MAFT+	mIoU	59.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	38.2	—	Unverified
2	MaskCLIP++	mIoU	38.2	—	Unverified
3	Mask-Adapter	mIoU	38.2	—	Unverified
4	CAT-Seg	mIoU	37.9	—	Unverified
5	SILC	mIoU	37.7	—	Unverified
6	UMG-CLIP-L/14	mIoU	36.1	—	Unverified
7	MAFT+	mIoU	36.1	—	Unverified
8	OVSeg + OpenDAS	mIoU	35.8	—	Unverified
9	SED	mIoU	35.2	—	Unverified
10	CLIPSelf	mIoU	34.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	17.3	—	Unverified
2	MaskCLIP++	mIoU	16.8	—	Unverified
3	Mask-Adapter	mIoU	16.2	—	Unverified
4	CAT-Seg	mIoU	16	—	Unverified
5	UMG-CLIP-L/14	mIoU	15.4	—	Unverified
6	MAFT+	mIoU	15.1	—	Unverified
7	SILC	mIoU	15	—	Unverified
8	PosSAM	mIoU	14.9	—	Unverified
9	FC-CLIP	mIoU	14.8	—	Unverified
10	SCAN	mIoU	14	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-L/14	mIoU	97.9	—	Unverified
2	SILC	mIoU	97.6	—	Unverified
3	SCAN	mIoU	97.2	—	Unverified
4	CAT-Seg	mIoU	97	—	Unverified
5	MaskCLIP++	mIoU	96.8	—	Unverified
6	MAFT+	mIoU	96.5	—	Unverified
7	EBSeg-L	mIoU	96.4	—	Unverified
8	FC-CLIP	mIoU	95.4	—	Unverified
9	OVSeg Swin-B	mIoU	94.5	—	Unverified
10	HyperSeg	mIoU	92.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SILC	mIoU	25.8	—	Unverified
2	UMG-CLIP-E/14	mIoU	25.2	—	Unverified
3	MaskCLIP++	mIoU	23.9	—	Unverified
4	CAT-Seg	mIoU	23.8	—	Unverified
5	UMG-CLIP-L/14	mIoU	23.2	—	Unverified
6	Mask-Adapter	mIoU	22.7	—	Unverified
7	SED	mIoU	22.6	—	Unverified
8	MAFT+	mIoU	21.6	—	Unverified
9	EBSeg-L	mIoU	21	—	Unverified
10	FC-CLIP	mIoU	18.2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	POMP	HIoU	39.1	—	Unverified
2	ZSSeg	HIoU	37.8	—	Unverified
3	ZegFormer	HIoU	34.8	—	Unverified
4	TTD (TCL)	mIoU	23.7	—	Unverified
5	LaVG	mIoU	23.2	—	Unverified
6	CLIP Surgery (original CLIP without any fine-tuning)	mIoU	21.9	—	Unverified
7	TTD (MaskCLIP)	mIoU	19.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FC-CLIP	mIoU	56.2	—	Unverified
2	SimSeg	mIoU	34.5	—	Unverified
3	TTD (TCL)	mIoU	32	—	Unverified
4	CLIP Surgery (CLIP without any fine-tuning)	mIoU	31.4	—	Unverified
5	TTD (MaskCLIP)	mIoU	27	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	UMG-CLIP-E/14	mIoU	85.4	—	Unverified
2	CAT-Seg	mIoU	82.5	—	Unverified
3	SILC	mIoU	82.5	—	Unverified
4	FC-CLIP	mIoU	81.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU-	43.9	—	Unverified
2	SegEarth-OV	mIoU-	21.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PACL	mIoU	38.8	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	8.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	54.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	30.89	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SkySense-O	mIoU	32.12	—	Unverified