| From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection | May 19, 2025 | feature selectionOut-of-Distribution Generalization | CodeCode Available | 1 |
| MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | May 9, 2025 | DiagnosticInstruction Following | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| Advancing Medical Representation Learning Through High-Quality Data | Mar 18, 2025 | Representation Learningzero-shot-classification | CodeCode Available | 1 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 |
| CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Feb 27, 2025 | Image-text matchingObject | CodeCode Available | 1 |
| CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification | Feb 25, 2025 | Denoisingzero-shot-classification | CodeCode Available | 1 |
| LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models | Feb 6, 2025 | zero-shot-classificationZero-shot Generalization | CodeCode Available | 1 |
| SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting | Dec 11, 2024 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 |
| CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Nov 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 |