SOTAVerified

zero-shot-classification

Papers

Showing 125 of 422 papers

TitleStatusHype
Multimodal Whole Slide Foundation Model for PathologyCode4
Multi-label Cluster Discrimination for Visual Representation LearningCode4
Long-CLIP: Unlocking the Long-Text Capability of CLIPCode4
FG-CLIP: Fine-Grained Visual and Textual AlignmentCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
LLM-Pruner: On the Structural Pruning of Large Language ModelsCode3
RWKV-CLIP: A Robust Vision-Language Representation LearnerCode2
Robust CLIP: Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language ModelsCode2
TabLLM: Few-shot Classification of Tabular Data with Large Language ModelsCode2
Large-scale and Fine-grained Vision-language Pre-training for Enhanced CT Image UnderstandingCode2
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
RemoteCLIP: A Vision Language Foundation Model for Remote SensingCode2
CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian EvaluationCode2
RS5M and GeoRSCLIP: A Large Scale Vision-Language Dataset and A Large Vision-Language Model for Remote SensingCode2
Boosting Vision-Language Models for Histopathology Classification: Predict all at onceCode2
VeCLIP: Improving CLIP Training via Visual-enriched CaptionsCode2
Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene ClassificationCode2
Harnessing Explanations: LLM-to-LM Interpreter for Enhanced Text-Attributed Graph Representation LearningCode2
CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic SegmentationCode2
BIOMEDICA: An Open Biomedical Image-Caption Archive, Dataset, and Vision-Language Models Derived from Scientific LiteratureCode2
CARZero: Cross-Attention Alignment for Radiology Zero-Shot ClassificationCode2
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language ModelsCode2
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert ReasonerCode2
DiffCLIP: Differential Attention Meets CLIPCode2
ULIP-2: Towards Scalable Multimodal Pre-training for 3D UnderstandingCode2
Show:102550
← PrevPage 1 of 17Next →

No leaderboard results yet.