SOTAVerified

zero-shot-classification

Papers

Showing 150 of 422 papers

TitleStatusHype
Multimodal Whole Slide Foundation Model for PathologyCode4
Multi-label Cluster Discrimination for Visual Representation LearningCode4
Long-CLIP: Unlocking the Long-Text Capability of CLIPCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
LLM-Pruner: On the Structural Pruning of Large Language ModelsCode3
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic SegmentationCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
VeCLIP: Improving CLIP Training via Visual-enriched CaptionsCode2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian EvaluationCode2
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene ClassificationCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
DiffCLIP: Differential Attention Meets CLIPCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
RWKV-CLIP: A Robust Vision-Language Representation LearnerCode2
Uni3D: Exploring Unified 3D Representation at ScaleCode2
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
Boosting Vision-Language Models for Histopathology Classification: Predict all at onceCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
ZegCLIP: Towards Adapting CLIP for Zero-shot Semantic SegmentationCode2
Zero-Shot ECG Classification with Multimodal Learning and Test-time Clinical Knowledge EnhancementCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
CARZero: Cross-Attention Alignment for Radiology Zero-Shot ClassificationCode2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image UnderstandingCode2
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert ReasonerCode2
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific LiteratureCode2
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language ModelsCode2
Your Diffusion Model is Secretly a Zero-Shot ClassifierCode2
Advancing Medical Representation Learning Through High-Quality DataCode1
ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language ModelsCode1
CountCLIP -- [Re] Teaching CLIP to Count to TenCode1
Controlling Latent Diffusion Using Latent CLIPCode1
Contrastive Language-Image Pre-training for the Italian LanguageCode1
Florence: A New Foundation Model for Computer VisionCode1
CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance SegmentationCode1
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot ClassificationCode1
Exploring Vision-Language Models for Imbalanced LearningCode1
CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image CollectionsCode1
CLIP-Lite: Information Efficient Visual Representation Learning with Language SupervisionCode1
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object RepresentationCode1
CyCLIP: Cyclic Contrastive Language-Image PretrainingCode1
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based SelectionCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object DetectionCode1
EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression RecognitionCode1
Discovering Human Interactions With Novel Objects via Zero-Shot LearningCode1
CLIP-Guided Source-Free Object Detection in Aerial ImagesCode1
CLIPArTT: Adaptation of CLIP to New Domains at Test TimeCode1
Discriminative Region-based Multi-Label Zero-Shot LearningCode1
Show:102550
← PrevPage 1 of 9Next →

No leaderboard results yet.