SOTAVerified

zero-shot-classification

Papers

Showing 151200 of 422 papers

TitleStatusHype
Progressive Local Alignment for Medical Multimodal Pre-training0
Using tournaments to calculate AUROC for zero-shot classification with LLMs0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense FeaturesCode0
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning0
Text Classification in the LLM Era - Where do we stand?0
Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering0
From Haystack to Needle: Label Space Reduction for Zero-shot Classification0
Captured by Captions: On Memorization and its Mitigation in CLIP Models0
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions0
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation ModelsCode0
KPL: Training-Free Medical Knowledge Mining of Vision-Language ModelsCode0
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing0
A Statistical Theory of Contrastive Pre-training and Multimodal Generative AICode0
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries0
Cross-Modal 3D Representation with Multi-View Images and Point Clouds0
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation0
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio0
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language AlignmentCode0
Adaptive Pruning for Large Language Models with Structural Importance Awareness0
Zero-Shot Image Moderation in Google Ads with LLM-Assisted Textual Descriptions and Cross-modal Co-embeddings0
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels0
A Simple and Efficient Baseline for Zero-Shot Generative Classification0
An Efficient Framework for Enhancing Discriminative Models via Diffusion TechniquesCode0
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?Code0
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning0
S^3: Synonymous Semantic Space for Improving Zero-Shot Generalization of Vision-Language Models0
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep LearningCode0
Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention NetworksCode0
Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIPCode0
Active Data Curation Effectively Distills Large-Scale Multimodal Models0
Measuring similarity between embedding spaces using induced neighborhood graphs0
NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics0
Asterisk*: Keep it Simple0
Enhancing Visual Classification using Comparative DescriptorsCode0
ResiDual Transformer Alignment with Spectral Decomposition0
Active Learning for Vision-Language Models0
Fine-tuned Large Language Models (LLMs): Improved Prompt Injection Attacks Detection0
Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models0
MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic ReportCode0
Assessing Open-world Forgetting in Generative Image Model Customization0
Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data?0
LLM Chain Ensembles for Scalable and Accurate Data AnnotationCode0
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning0
A Unified Debiasing Approach for Vision-Language Models across Modalities and TasksCode0
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language ModelsCode0
Improving Predictor Reliability with Selective Recalibration0
An Evaluation of Large Pre-Trained Models for Gesture Recognition using Synthetic Videos0
Toward a Holistic Evaluation of Robustness in CLIP Models0
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion ModelsCode0
Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.