| Contrastive Language-Image Pre-training for the Italian Language | Aug 19, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 |
| Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition | Jun 13, 2024 | Retrievalzero-shot-classification | CodeCode Available | 1 |
| Zero-Shot Semantic Segmentation | Jun 3, 2019 | General ClassificationSegmentation | CodeCode Available | 1 |
| Attentive Mask CLIP | Dec 16, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 |
| CLIP-VIS: Adapting CLIP for Open-Vocabulary Video Instance Segmentation | Mar 19, 2024 | DecoderInstance Segmentation | CodeCode Available | 1 |
| Image-free Classifier Injection for Zero-Shot Classification | Aug 21, 2023 | ClassificationDecoder | CodeCode Available | 1 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 |
| CountCLIP -- [Re] Teaching CLIP to Count to Ten | Jun 5, 2024 | zero-shot-classificationZero-Shot Counting | CodeCode Available | 1 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 |