Unsupervised Semantic Segmentation with Language-image Pre-training

A segmentation task which does not utilise any human-level supervision for semantic segmentation except for a backbone which is initialised with features pre-trained with image-level labels.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 14 papers

Title	Date	Tasks	Status	Hype
TextRegion: Text-Aligned Region Tokens from Frozen Image-Text Models	May 29, 2025	Referring ExpressionReferring Expression Comprehension	CodeCode Available	2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Nov 15, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation	Nov 14, 2024	SegmentationSemantic Segmentation	CodeCode Available	2
ProxyCLIP: Proxy Attention Improves CLIP for Open-Vocabulary Segmentation	Aug 9, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
GroupViT: Semantic Segmentation Emerges from Text Supervision	Feb 22, 2022	Object DetectionScene Understanding	CodeCode Available	2
COSMOS: Cross-Modality Self-Distillation for Vision Language Pre-training	Dec 2, 2024	Self-Supervised LearningSemantic Segmentation	CodeCode Available	1
ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements	Nov 18, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
TTD: Text-Tag Self-Distillation Enhancing Image-Text Alignment in CLIP to Alleviate Single Tag Bias	Mar 30, 2024	Multi-Label Text ClassificationOpen Vocabulary Semantic Segmentation	CodeCode Available	1
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification	Dec 21, 2023	AttributeOpen Vocabulary Semantic Segmentation	CodeCode Available	1
TagCLIP: A Local-to-Global Framework to Enhance Open-Vocabulary Multi-Label Classification of CLIP Without Training	Dec 20, 2023	ClassificationMulti-Label Classification	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 2Next →

No leaderboard results yet.