| ViLAaD: Enhancing "Attracting and Dispersing'' Source-Free Domain Adaptation with Vision-and-Language Model | Mar 30, 2025 | Domain AdaptationLanguage Modeling | —Unverified | 0 |
| Enhancing Small Language Models for Cross-Lingual Generalized Zero-Shot Classification with Soft Prompt Tuning | Mar 25, 2025 | Cross-Lingual Transferzero-shot-classification | —Unverified | 0 |
| Seeing What Matters: Empowering CLIP with Patch Generation-to-Selection | Mar 21, 2025 | Edge DetectionRetrieval | —Unverified | 0 |
| Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection | Mar 18, 2025 | Uncertainty Quantificationzero-shot-classification | CodeCode Available | 0 |
| Advancing Medical Representation Learning Through High-Quality Data | Mar 18, 2025 | Representation Learningzero-shot-classification | CodeCode Available | 1 |
| Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning | Mar 16, 2025 | Cell DetectionClassification | CodeCode Available | 0 |
| TLAC: Two-stage LMM Augmented CLIP for Zero-Shot Classification | Mar 15, 2025 | Domain Generalizationimage-classification | CodeCode Available | 0 |
| Leveraging Vision-Language Embeddings for Zero-Shot Learning in Histopathology Images | Mar 13, 2025 | Diagnosticimage-classification | —Unverified | 0 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 |
| DiffCLIP: Differential Attention Meets CLIP | Mar 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment | Mar 3, 2025 | Anomaly LocalizationClassification | CodeCode Available | 0 |
| A Zero-Shot Learning Approach for Ephemeral Gully Detection from Remote Sensing using Vision Language Models | Mar 3, 2025 | Transfer Learningzero-shot-classification | —Unverified | 0 |
| Analyzing CLIP's Performance Limitations in Multi-Object Scenarios: A Controlled High-Resolution Study | Feb 27, 2025 | Image GenerationObject | —Unverified | 0 |
| SuPreME: A Supervised Pre-training Framework for Multimodal ECG Representation Learning | Feb 27, 2025 | DiagnosticRepresentation Learning | —Unverified | 0 |
| CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Feb 27, 2025 | Image-text matchingObject | CodeCode Available | 1 |
| Knowledge-enhanced Multimodal ECG Representation Learning with Arbitrary-Lead Inputs | Feb 25, 2025 | Representation Learningzero-shot-classification | —Unverified | 0 |
| Progressive Local Alignment for Medical Multimodal Pre-training | Feb 25, 2025 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification | Feb 25, 2025 | Denoisingzero-shot-classification | CodeCode Available | 1 |
| UniGS: Unified Language-Image-3D Pretraining with Gaussian Splatting | Feb 25, 2025 | 3DGScross-modal alignment | —Unverified | 0 |
| Using tournaments to calculate AUROC for zero-shot classification with LLMs | Feb 20, 2025 | Binary ClassificationClassification | —Unverified | 0 |
| SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features | Feb 20, 2025 | FairnessImage-text Retrieval | CodeCode Available | 0 |
| Enhancing Chest X-ray Classification through Knowledge Injection in Cross-Modality Learning | Feb 19, 2025 | Caption GenerationClassification | —Unverified | 0 |
| Text Classification in the LLM Era - Where do we stand? | Feb 17, 2025 | ClassificationSentiment Analysis | —Unverified | 0 |
| Optimizing GPT for Video Understanding: Zero-Shot Performance and Prompt Engineering | Feb 13, 2025 | ClassificationPrompt Engineering | —Unverified | 0 |
| From Haystack to Needle: Label Space Reduction for Zero-shot Classification | Feb 12, 2025 | Classificationzero-shot-classification | —Unverified | 0 |