zero-shot-classification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 422 papers

Title	Date	Tasks	Status	Hype
FG-CLIP: Fine-Grained Visual and Textual Alignment	May 8, 2025	Image-text Retrievalobject-detection	CodeCode Available	4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and Challenges	Jan 4, 2025	FairnessHallucination	CodeCode Available	4
Multimodal Whole Slide Foundation Model for Pathology	Nov 29, 2024	Cross-Modal Retrievalmodel	CodeCode Available	4
Multi-label Cluster Discrimination for Visual Representation Learning	Jul 24, 2024	Contrastive LearningImage-text Retrieval	CodeCode Available	4
Long-CLIP: Unlocking the Long-Text Capability of CLIP	Mar 22, 2024	Image GenerationImage Retrieval	CodeCode Available	4
LLM-Pruner: On the Structural Pruning of Large Language Models	May 19, 2023	Text Generationzero-shot-classification	CodeCode Available	3
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models	May 30, 2025	ClassificationDisaster Response	CodeCode Available	2
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner	May 16, 2025	Cross-Modal RetrievalDiagnostic	CodeCode Available	2
DiffCLIP: Differential Attention Meets CLIP	Mar 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image Understanding	Jan 24, 2025	AnatomyContrastive Learning	CodeCode Available	2
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific Literature	Jan 13, 2025	ArticlesImage-text Retrieval	CodeCode Available	2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation	Nov 15, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	2
Boosting Vision-Language Models for Histopathology Classification: Predict all at once	Sep 3, 2024	Allzero-shot-classification	CodeCode Available	2
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification	Sep 1, 2024	Scene ClassificationTransductive Zero-Shot Classification	CodeCode Available	2
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIP	Jun 25, 2024	cross-modal alignmentImage Classification	CodeCode Available	2
RWKV-CLIP: A Robust Vision-Language Representation Learner	Jun 11, 2024	Image-text RetrievalRepresentation Learning	CodeCode Available	2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation	Apr 30, 2024	MambaState Space Models	CodeCode Available	2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge Enhancement	Mar 11, 2024	Clinical KnowledgeDescriptive	CodeCode Available	2
CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification	Feb 27, 2024	ClassificationDiagnostic	CodeCode Available	2
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models	Feb 19, 2024	Adversarial DefenseMultimodal Deep Learning	CodeCode Available	2
VeCLIP: Improving CLIP Training via Visual-enriched Captions	Oct 11, 2023	Image-text RetrievalRetrieval	CodeCode Available	2
Uni3D: Exploring Unified 3D Representation at Scale	Oct 10, 2023	3D Object ClassificationRetrieval	CodeCode Available	2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote Sensing	Jun 20, 2023	Cross-Modal RetrievalImage Retrieval	CodeCode Available	2
RemoteCLIP: A Vision Language Foundation Model for Remote Sensing	Jun 19, 2023	ClassificationCross-Modal Retrieval	CodeCode Available	2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation Learning	May 31, 2023	Decision MakingGeneral Knowledge	CodeCode Available	2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D Understanding	May 14, 2023	3D Classification3D Point Cloud Classification	CodeCode Available	2
Your Diffusion Model is Secretly a Zero-Shot Classifier	Mar 28, 2023	Domain GeneralizationFine-Grained Image Classification	CodeCode Available	2
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic Segmentation	Dec 7, 2022	Semantic Segmentationzero-shot-classification	CodeCode Available	2
TabLLM: Few-shot Classification of Tabular Data with Large Language Models	Oct 19, 2022	ClassificationDeep Learning	CodeCode Available	2
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Jun 10, 2025	Contrastive LearningImage-text matching	CodeCode Available	1
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection	May 19, 2025	feature selectionOut-of-Distribution Generalization	CodeCode Available	1
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks	May 9, 2025	DiagnosticInstruction Following	CodeCode Available	1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction	Apr 4, 2025	AttributeLanguage Modeling	CodeCode Available	1
Advancing Medical Representation Learning Through High-Quality Data	Mar 18, 2025	Representation Learningzero-shot-classification	CodeCode Available	1
Controlling Latent Diffusion Using Latent CLIP	Mar 11, 2025	DenoisingDescriptive	CodeCode Available	1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation	Feb 27, 2025	Image-text matchingObject	CodeCode Available	1
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification	Feb 25, 2025	Denoisingzero-shot-classification	CodeCode Available	1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models	Feb 6, 2025	zero-shot-classificationZero-shot Generalization	CodeCode Available	1
SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting	Dec 11, 2024	zero-shot-classificationZero-Shot Learning	CodeCode Available	1
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections	Nov 28, 2024	image-classificationImage Classification	CodeCode Available	1
TableTime: Reformulating Time Series Classification as Zero-Shot Table Understanding via Large Language Models	Nov 24, 2024	Problem DecompositionTime Series	CodeCode Available	1
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation	Nov 21, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	1
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language Models	Nov 6, 2024	image-classificationImage Classification	CodeCode Available	1
Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge	Oct 16, 2024	Classificationimage-classification	CodeCode Available	1
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment	Oct 2, 2024	Self-Supervised Learningzero-shot-classification	CodeCode Available	1
DC3DO: Diffusion Classifier for 3D Objects	Aug 13, 2024	3D Object ClassificationClassification	CodeCode Available	1
Adversarial Robustification via Text-to-Image Diffusion Models	Jul 26, 2024	Adversarial Robustnesszero-shot-classification	CodeCode Available	1
Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition	Jun 13, 2024	Retrievalzero-shot-classification	CodeCode Available	1
CountCLIP -- [Re] Teaching CLIP to Count to Ten	Jun 5, 2024	zero-shot-classificationZero-Shot Counting	CodeCode Available	1
Differentiable Model Scaling using Differentiable Topk	May 12, 2024	GPUimage-classification	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 9Next →

No leaderboard results yet.