| Interpreting and Analysing CLIP's Zero-Shot Image Classification via Mutual Knowledge | Oct 16, 2024 | Classificationimage-classification | CodeCode Available | 1 | 5 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 | 5 |
| Improved Zero-Shot Classification by Adapting VLMs with Text Descriptions | Jan 4, 2024 | Fine-Grained Image Classificationimage-classification | CodeCode Available | 1 | 5 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 | 5 |
| EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Oct 25, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 | 5 |
| CountCLIP -- [Re] Teaching CLIP to Count to Ten | Jun 5, 2024 | zero-shot-classificationZero-Shot Counting | CodeCode Available | 1 | 5 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| CyCLIP: Cyclic Contrastive Language-Image Pretraining | May 28, 2022 | Representation LearningVisual Reasoning | CodeCode Available | 1 | 5 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 | 5 |
| Decoupling Zero-Shot Semantic Segmentation | Dec 15, 2021 | Open Vocabulary Semantic SegmentationSegmentation | CodeCode Available | 1 | 5 |
| Zero-Shot Semantic Segmentation | Jun 3, 2019 | General ClassificationSegmentation | CodeCode Available | 1 | 5 |
| Attentive Mask CLIP | Dec 16, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 | 5 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 | 5 |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Mar 17, 2020 | Action ClassificationClassification | CodeCode Available | 1 | 5 |
| CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Mar 19, 2024 | DecoderInstance Segmentation | CodeCode Available | 1 | 5 |
| AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment | Oct 2, 2024 | Self-Supervised Learningzero-shot-classification | CodeCode Available | 1 | 5 |
| CLIPure: Purification in Latent Space via CLIP for Adversarially Robust Zero-Shot Classification | Feb 25, 2025 | Denoisingzero-shot-classification | CodeCode Available | 1 | 5 |
| CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Nov 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Contrastive Language-Image Pre-training for the Italian Language | Aug 19, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 | 5 |
| CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Nov 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 | 5 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 | 5 |
| Adversarial Robustification via Text-to-Image Diffusion Models | Jul 26, 2024 | Adversarial Robustnesszero-shot-classification | CodeCode Available | 1 | 5 |
| CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Feb 27, 2025 | Image-text matchingObject | CodeCode Available | 1 | 5 |
| Adversarial Illusions in Multi-Modal Embeddings | Aug 22, 2023 | Image GenerationText Generation | CodeCode Available | 1 | 5 |
| CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention | Sep 28, 2022 | Training-free 3D Point Cloud ClassificationTransfer Learning | CodeCode Available | 1 | 5 |