SOTAVerified

zero-shot-classification

Papers

Showing 2650 of 422 papers

TitleStatusHype
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
Your Diffusion Model is Secretly a Zero-Shot ClassifierCode2
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic SegmentationCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based SelectionCode1
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from TextbooksCode1
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token PredictionCode1
Advancing Medical Representation Learning Through High-Quality DataCode1
Controlling Latent Diffusion Using Latent CLIPCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot ClassificationCode1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation ModelsCode1
SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level promptingCode1
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image CollectionsCode1
TableTime: Reformulating Time Series Classification as Zero-Shot Table Understanding via Large Language ModelsCode1
CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic SegmentationCode1
RaVL: Discovering and Mitigating Spurious Correlations in Fine-Tuned Vision-Language ModelsCode1
Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual KnowledgeCode1
AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model AlignmentCode1
DC3DO: Diffusion Classifier for 3D ObjectsCode1
Adversarial Robustification via Text-to-Image Diffusion ModelsCode1
Exploring the Spectrum of Visio-Linguistic Compositionality and RecognitionCode1
CountCLIP -- [Re] Teaching CLIP to Count to TenCode1
Differentiable Model Scaling using Differentiable TopkCode1
Show:102550
← PrevPage 2 of 17Next →

No leaderboard results yet.