zero-shot-classification

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 422 papers

Title	Date	Tasks	Status	Hype
DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation	Jul 14, 2025	DecoderGPU	CodeCode Available	0
Comparison of ConvNeXt and Vision-Language Models for Breast Density Assessment in Screening Mammography	Jun 16, 2025	breast density classificationClassification	—Unverified	0
Harmonizing and Merging Source Models for CLIP-based Domain Generalization	Jun 11, 2025	Domain Generalizationzero-shot-classification	—Unverified	0
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models	Jun 10, 2025	Contrastive LearningImage-text matching	CodeCode Available	1
GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models	May 30, 2025	ClassificationDisaster Response	CodeCode Available	2
Distill CLIP (DCLIP): Enhancing Image-Text Retrieval via Cross-Modal Transformer Distillation	May 25, 2025	Contrastive LearningImage-text Retrieval	—Unverified	0
AmorLIP: Efficient Language-Image Pretraining via Amortization	May 25, 2025	Contrastive LearningRepresentation Learning	CodeCode Available	0
Beginning with You: Perceptual-Initialization Improves Vision-Language Representation and Alignment	May 20, 2025	Representation LearningRetrieval	—Unverified	0
From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection	May 19, 2025	feature selectionOut-of-Distribution Generalization	CodeCode Available	1
StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment	May 19, 2025	zero-shot-classificationZero-Shot Learning	CodeCode Available	0
Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption	May 19, 2025	Knowledge DistillationTest-time Adaptation	CodeCode Available	0
Patho-R1: A Multimodal Reinforcement Learning-Based Pathology Expert Reasoner	May 16, 2025	Cross-Modal RetrievalDiagnostic	CodeCode Available	2
Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors	May 15, 2025	Language ModelingLanguage Modelling	—Unverified	0
TACOS: Temporally-aligned Audio CaptiOnS for Language-Audio Pretraining	May 12, 2025	Audio captioningAudio Generation	—Unverified	0
Image Classification Using a Diffusion Model as a Pre-Training Model	May 11, 2025	Contrastive Learningimage-classification	—Unverified	0
MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks	May 9, 2025	DiagnosticInstruction Following	CodeCode Available	1
FG-CLIP: Fine-Grained Visual and Textual Alignment	May 8, 2025	Image-text Retrievalobject-detection	CodeCode Available	4
Fill the Gap: Quantifying and Reducing the Modality Gap in Image-Text Representation Learning	May 6, 2025	Representation Learningzero-shot-classification	—Unverified	0
On the effectiveness of Large Language Models in the mechanical design domain	May 2, 2025	ClassificationSentence	CodeCode Available	0
Helping Large Language Models Protect Themselves: An Enhanced Filtering and Summarization System	May 2, 2025	zero-shot-classificationZero-Shot Learning	—Unverified	0
Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection	Apr 21, 2025	zero-shot-classificationZero-Shot Learning	CodeCode Available	0
RadZero: Similarity-Based Cross-Attention for Explainable Vision-Language Alignment in Radiology with Zero-Shot Multi-Task Capability	Apr 10, 2025	Contrastive LearningOpen Vocabulary Semantic Segmentation	—Unverified	0
Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction	Apr 4, 2025	AttributeLanguage Modeling	CodeCode Available	1
Refining CLIP's Spatial Awareness: A Visual-Centric Perspective	Apr 3, 2025	zero-shot-classificationZero-Shot Learning	—Unverified	0
CIBR: Cross-modal Information Bottleneck Regularization for Robust CLIP Generalization	Mar 31, 2025	Contrastive Learningimage-classification	—Unverified	0
ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model	Mar 30, 2025	Domain AdaptationLanguage Modeling	—Unverified	0
Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning	Mar 25, 2025	Cross-Lingual Transferzero-shot-classification	—Unverified	0
Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection	Mar 21, 2025	Edge DetectionRetrieval	—Unverified	0
Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection	Mar 18, 2025	Uncertainty Quantificationzero-shot-classification	CodeCode Available	0
Advancing Medical Representation Learning Through High-Quality Data	Mar 18, 2025	Representation Learningzero-shot-classification	CodeCode Available	1
Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning	Mar 16, 2025	Cell DetectionClassification	CodeCode Available	0
TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification	Mar 15, 2025	Domain Generalizationimage-classification	CodeCode Available	0
Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images	Mar 13, 2025	Diagnosticimage-classification	—Unverified	0
Controlling Latent Diffusion Using Latent CLIP	Mar 11, 2025	DenoisingDescriptive	CodeCode Available	1
DiffCLIP: Differential Attention Meets CLIP	Mar 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment	Mar 3, 2025	Anomaly LocalizationClassification	CodeCode Available	0
A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models	Mar 3, 2025	Transfer Learningzero-shot-classification	—Unverified	0
Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study	Feb 27, 2025	Image GenerationObject	—Unverified	0
SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning	Feb 27, 2025	DiagnosticRepresentation Learning	—Unverified	0
CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation	Feb 27, 2025	Image-text matchingObject	CodeCode Available	1
Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs	Feb 25, 2025	Representation Learningzero-shot-classification	—Unverified	0
Progressive Local Alignment for Medical Multimodal Pre-training	Feb 25, 2025	Contrastive LearningImage-text Retrieval	—Unverified	0
CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification	Feb 25, 2025	Denoisingzero-shot-classification	CodeCode Available	1
UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting	Feb 25, 2025	3DGScross-modal alignment	—Unverified	0
Using tournaments to calculate AUROC for zero-shot classification with LLMs	Feb 20, 2025	Binary ClassificationClassification	—Unverified	0
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features	Feb 20, 2025	FairnessImage-text Retrieval	—Unverified	0
Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning	Feb 19, 2025	Caption GenerationClassification	—Unverified	0
Text Classification in the LLM Era - Where do we stand?	Feb 17, 2025	ClassificationSentiment Analysis	—Unverified	0
Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering	Feb 13, 2025	ClassificationPrompt Engineering	—Unverified	0
From Haystack to Needle: Label Space Reduction for Zero-shot Classification	Feb 12, 2025	Classificationzero-shot-classification	—Unverified	0

Show:10 25 50

← PrevPage 1 of 9Next →

No leaderboard results yet.