SOTAVerified

zero-shot-classification

Papers

Showing 151175 of 422 papers

TitleStatusHype
Progressive Local Alignment for Medical Multimodal Pre-training0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features0
Using tournaments to calculate AUROC for zero-shot classification with LLMs0
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning0
Text Classification in the LLM Era - Where do we stand?0
Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering0
From Haystack to Needle: Label Space Reduction for Zero-shot Classification0
Captured by Captions: On Memorization and its Mitigation in CLIP Models0
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions0
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation ModelsCode0
KPL: Training-Free Medical Knowledge Mining of Vision-Language ModelsCode0
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing0
A Statistical Theory of Contrastive Pre-training and Multimodal Generative AICode0
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries0
Cross-Modal 3D Representation with Multi-View Images and Point Clouds0
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation0
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language AlignmentCode0
Adaptive Pruning for Large Language Models with Structural Importance Awareness0
Zero-Shot Image Moderation in Google Ads with LLM-Assisted Textual Descriptions and Cross-modal Co-embeddings0
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels0
A Simple and Efficient Baseline for Zero-Shot Generative Classification0
An Efficient Framework for Enhancing Discriminative Models via Diffusion TechniquesCode0
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?Code0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
Show:102550
← PrevPage 7 of 17Next →

No leaderboard results yet.