SOTAVerified

zero-shot-classification

Papers

Showing 151175 of 422 papers

TitleStatusHype
Contrastive Adapters for Foundation Model Group Robustness0
BAMBINO-LM: (Bilingual-)Human-Inspired Continual Pretraining of BabyLM0
Adaptive Pruning for Large Language Models with Structural Importance Awareness0
Context-Aware Prompt Tuning for Vision-Language Model with Dual-Alignment0
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models0
Analysis of the Fed's communication by using textual entailment model of Zero-Shot classification0
Compressing Unknown Images With Product Quantizer for Efficient Zero-Shot Classification0
Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography0
A Zero-Shot Classification Approach for a Word-Guessing Challenge0
Explain via Any Concept: Concept Bottleneck Model with Open Vocabulary Concepts0
Improving Semantic Embedding Consistency by Metric Learning for Zero-Shot Classification0
Intriguing Differences Between Zero-Shot and Systematic Evaluations of Vision-Language Transformer Models0
Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models0
Coarse2Fine: Fine-grained Text Classification on Coarsely-grained Annotated Data0
CLIP with Quality Captions: A Strong Pretraining for Vision Tasks0
AVGZSLNet: Audio-Visual Generalized Zero-Shot Learning by Reconstructing Label Features from Multi-Modal Embeddings0
An Analysis of Semantically-Aligned Speech-Text Embeddings0
CLIP meets Model Zoo Experts: Pseudo-Supervision for Visual Enhancement0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
Adapting Language-Audio Models as Few-Shot Audio Learners0
Image Classification Using a Diffusion Model as a Pre-Training Model0
Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data0
Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP0
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing0
Adapting Dual-encoder Vision-language Models for Paraphrased Retrieval0
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.