zero-shot-classification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 422 papers

Title	Date	Tasks	Status
Progressive Local Alignment for Medical Multimodal Pre-training	Feb 25, 2025	Contrastive LearningImage-text Retrieval	—Unverified
Using tournaments to calculate AUROC for zero-shot classification with LLMs	Feb 20, 2025	Binary ClassificationClassification	—Unverified
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features	Feb 20, 2025	FairnessImage-text Retrieval	—Unverified
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning	Feb 19, 2025	Caption GenerationClassification	—Unverified
Text Classification in the LLM Era - Where do we stand?	Feb 17, 2025	ClassificationSentiment Analysis	—Unverified
Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering	Feb 13, 2025	ClassificationPrompt Engineering	—Unverified
From Haystack to Needle: Label Space Reduction for Zero-shot Classification	Feb 12, 2025	Classificationzero-shot-classification	—Unverified
Captured by Captions: On Memorization and its Mitigation in CLIP Models	Feb 11, 2025	Image RetrievalMemorization	—Unverified
DCFormer: Efficient 3D Vision-Language Modeling with Decomposed Convolutions	Feb 7, 2025	Anomaly DetectionImage-text Retrieval	—Unverified
Revisiting CLIP: Efficient Alignment of 3D MRI and Tabular Data using Domain-Specific Foundation Models	Jan 23, 2025	Image RetrievalRetrieval	CodeCode Available
KPL: Training-Free Medical Knowledge Mining of Vision-Language Models	Jan 20, 2025	Classificationimage-classification	CodeCode Available
FLAVARS: A Multimodal Foundational Language and Vision Alignment Model for Remote Sensing	Jan 14, 2025	ClassificationContrastive Learning	—Unverified
A Statistical Theory of Contrastive Pre-training and Multimodal Generative AI	Jan 8, 2025	zero-shot-classificationZero-Shot Learning	CodeCode Available
LLMs & Legal Aid: Understanding Legal Needs Exhibited Through User Queries	Jan 3, 2025	Hallucinationzero-shot-classification	—Unverified
Cross-Modal 3D Representation with Multi-View Images and Point Clouds	Jan 1, 2025	Autonomous DrivingCross-Modal Retrieval	—Unverified
Generalized Zero-Shot Classification via Semantics-Free Inter-Class Feature Generation	Jan 1, 2025	Classificationcross-modal alignment	—Unverified
Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio	Dec 23, 2024	Contrastive LearningPrompt Learning	—Unverified
DINOv2 Meets Text: A Unified Framework for Image- and Pixel-Level Vision-Language Alignment	Dec 20, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available
Adaptive Pruning for Large Language Models with Structural Importance Awareness	Dec 19, 2024	Text Generationzero-shot-classification	—Unverified
Zero-Shot Image Moderation in Google Ads with LLM-Assisted Textual Descriptions and Cross-modal Co-embeddings	Dec 18, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
CRoF: CLIP-based Robust Few-shot Learning on Noisy Labels	Dec 17, 2024	Domain GeneralizationFew-Shot Learning	—Unverified
A Simple and Efficient Baseline for Zero-Shot Generative Classification	Dec 17, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques	Dec 12, 2024	Classificationimage-classification	CodeCode Available
Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision?	Dec 11, 2024	Prompt Learningzero-shot-classification	CodeCode Available
Explaining and Mitigating the Modality Gap in Contrastive Multimodal Learning	Dec 10, 2024	Contrastive LearningImage-text Retrieval	—Unverified
S^3: Synonymous Semantic Space for Improving Zero-Shot Generalization of Vision-Language Models	Dec 6, 2024	zero-shot-classificationZero-shot Generalization	—Unverified
Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning	Dec 5, 2024	Comment GenerationDecoder	CodeCode Available
Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks	Dec 3, 2024	ClassificationScene Classification	CodeCode Available
Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP	Dec 1, 2024	Natural Language Understandingzero-shot-classification	CodeCode Available
Active Data Curation Effectively Distills Large-Scale Multimodal Models	Nov 27, 2024	DecoderImage Captioning	—Unverified
Measuring similarity between embedding spaces using induced neighborhood graphs	Nov 13, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
NatureLM-audio: an Audio-Language Foundation Model for Bioacoustics	Nov 11, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
Asterisk*: Keep it Simple	Nov 8, 2024	ClassificationKnowledge Distillation	—Unverified
Enhancing Visual Classification using Comparative Descriptors	Nov 8, 2024	Classificationzero-shot-classification	CodeCode Available
ResiDual Transformer Alignment with Spectral Decomposition	Oct 31, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
Active Learning for Vision-Language Models	Oct 29, 2024	Active Learningimage-classification	—Unverified
Fine-tuned Large Language Models (LLMs): Improved Prompt Injection Attacks Detection	Oct 28, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
Label Set Optimization via Activation Distribution Kurtosis for Zero-shot Classification with Generative Models	Oct 24, 2024	ClassificationIn-Context Learning	—Unverified
MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report	Oct 21, 2024	DiagnosticMedical Diagnosis	CodeCode Available
Assessing Open-world Forgetting in Generative Image Model Customization	Oct 18, 2024	Image Generationzero-shot-classification	—Unverified
Can Medical Vision-Language Pre-training Succeed with Purely Synthetic Data?	Oct 17, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
LLM Chain Ensembles for Scalable and Accurate Data Annotation	Oct 16, 2024	zero-shot-classificationZero-Shot Learning	CodeCode Available
CtrlSynth: Controllable Image Text Synthesis for Data-Efficient Multimodal Learning	Oct 15, 2024	Image-text RetrievalText Retrieval	—Unverified
A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks	Oct 10, 2024	FairnessImage Captioning	CodeCode Available
GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models	Oct 8, 2024	zero-shot-classificationZero-Shot Learning	CodeCode Available
Improving Predictor Reliability with Selective Recalibration	Oct 7, 2024	zero-shot-classificationZero-Shot Learning	—Unverified
An Evaluation of Large Pre-Trained Models for Gesture Recognition using Synthetic Videos	Oct 3, 2024	ClassificationGesture Recognition	—Unverified
Toward a Holistic Evaluation of Robustness in CLIP Models	Oct 2, 2024	ClassificationOut-of-Distribution Detection	—Unverified
NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models	Oct 1, 2024	Contrastive LearningEEG	CodeCode Available
Zero-Shot Classification of Crisis Tweets Using Instruction-Finetuned Large Language Models	Sep 30, 2024	ClassificationDisaster Response	—Unverified

Show:10 25 50

← PrevPage 4 of 9Next →

No leaderboard results yet.