| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 | 5 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 | 5 |
| DC3DO: Diffusion Classifier for 3D Objects | Aug 13, 2024 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| CyCLIP: Cyclic Contrastive Language-Image Pretraining | May 28, 2022 | Representation LearningVisual Reasoning | CodeCode Available | 1 | 5 |
| Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge | Oct 16, 2024 | Classificationimage-classification | CodeCode Available | 1 | 5 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 | 5 |
| Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion | Feb 7, 2023 | ClassificationDiversity | CodeCode Available | 1 | 5 |
| CountCLIP -- [Re] Teaching CLIP to Count to Ten | Jun 5, 2024 | zero-shot-classificationZero-Shot Counting | CodeCode Available | 1 | 5 |
| SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting | Dec 11, 2024 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 | 5 |
| Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models | Jun 10, 2025 | Contrastive LearningImage-text matching | CodeCode Available | 1 | 5 |