| Image-free Classifier Injection for Zero-Shot Classification | Aug 21, 2023 | ClassificationDecoder | CodeCode Available | 1 | 5 |
| MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | May 9, 2025 | DiagnosticInstruction Following | CodeCode Available | 1 | 5 |
| Exploring the Spectrum of Visio-Linguistic Compositionality and Recognition | Jun 13, 2024 | Retrievalzero-shot-classification | CodeCode Available | 1 | 5 |
| Modeling Caption Diversity in Contrastive Vision-Language Pretraining | Apr 30, 2024 | Diversityzero-shot-classification | CodeCode Available | 1 | 5 |
| No Token Left Behind: Explainability-Aided Image Classification and Generation | Apr 11, 2022 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Label Propagation for Zero-shot Classification with Vision-Language Models | Apr 5, 2024 | ClassificationImage Classification | CodeCode Available | 1 | 5 |
| Beyond the Next Token: Towards Prompt-Robust Zero-Shot Classification via Efficient Multi-Token Prediction | Apr 4, 2025 | AttributeLanguage Modeling | CodeCode Available | 1 | 5 |
| Efficient Medical Vision-Language Alignment Through Adapting Masked Vision Models | Jun 10, 2025 | Contrastive LearningImage-text matching | CodeCode Available | 1 | 5 |
| ArcheType: A Novel Framework for Open-Source Column Type Annotation using Large Language Models | Oct 27, 2023 | Column Type AnnotationTable annotation | CodeCode Available | 1 | 5 |
| The Parrot Dilemma: Human-Labeled vs. LLM-augmented Data in Classification Tasks | Apr 26, 2023 | Data AugmentationLanguage Modelling | CodeCode Available | 1 | 5 |
| Advancing Medical Representation Learning Through High-Quality Data | Mar 18, 2025 | Representation Learningzero-shot-classification | CodeCode Available | 1 | 5 |
| EmoCLIP: A Vision-Language Method for Zero-Shot Video Facial Expression Recognition | Oct 25, 2023 | Facial Expression RecognitionFacial Expression Recognition (FER) | CodeCode Available | 1 | 5 |
| Exploring Vision-Language Models for Imbalanced Learning | Apr 4, 2023 | Decoderzero-shot-classification | CodeCode Available | 1 | 5 |
| Discovering Human Interactions With Novel Objects via Zero-Shot Learning | Jun 1, 2020 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Attentive Mask CLIP | Dec 16, 2022 | Contrastive LearningRetrieval | CodeCode Available | 1 | 5 |
| Discriminative Region-based Multi-Label Zero-Shot Learning | Aug 20, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 | 5 |
| MineralImage5k: A benchmark for zero-shot raw mineral visual recognition and description | Jul 20, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 | 5 |
| OpenDlign: Open-World Point Cloud Understanding with Depth-Aligned Images | Apr 25, 2024 | Representation LearningTransfer Learning | CodeCode Available | 1 | 5 |
| LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models | Feb 6, 2025 | zero-shot-classificationZero-shot Generalization | CodeCode Available | 1 | 5 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 | 5 |
| MedBLIP: Bootstrapping Language-Image Pre-training from 3D Medical Images and Texts | May 18, 2023 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 | 5 |
| Adversarial Illusions in Multi-Modal Embeddings | Aug 22, 2023 | Image GenerationText Generation | CodeCode Available | 1 | 5 |
| Deep Learning Models for Multilingual Hate Speech Detection | Apr 14, 2020 | Deep LearningHate Speech Detection | CodeCode Available | 1 | 5 |
| Lite-Mind: Towards Efficient and Robust Brain Representation Network | Dec 6, 2023 | Brain DecodingImage Retrieval | CodeCode Available | 1 | 5 |
| CHiLS: Zero-Shot Image Classification with Hierarchical Label Sets | Feb 6, 2023 | Classificationimage-classification | CodeCode Available | 1 | 5 |
| AgriCLIP: Adapting CLIP for Agriculture and Livestock via Domain-Specialized Cross-Model Alignment | Oct 2, 2024 | Self-Supervised Learningzero-shot-classification | CodeCode Available | 1 | 5 |
| Decoupling Zero-Shot Semantic Segmentation | Dec 15, 2021 | Open Vocabulary Semantic SegmentationSegmentation | CodeCode Available | 1 | 5 |
| Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges | Dec 12, 2023 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| Episode-based Prototype Generating Network for Zero-Shot Learning | Sep 8, 2019 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 | 5 |
| CLIPArTT: Adaptation of CLIP to New Domains at Test Time | May 1, 2024 | Pseudo LabelTest-time Adaptation | CodeCode Available | 1 | 5 |
| Zero-Shot Semantic Segmentation | Jun 3, 2019 | General ClassificationSegmentation | CodeCode Available | 1 | 5 |
| CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free | Sep 25, 2023 | Image SegmentationObject Localization | CodeCode Available | 1 | 5 |
| DST-Det: Simple Dynamic Self-Training for Open-Vocabulary Object Detection | Oct 2, 2023 | Novel Object DetectionObject | CodeCode Available | 1 | 5 |
| CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation | Nov 21, 2024 | Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation | CodeCode Available | 1 | 5 |
| CLIP-Guided Source-Free Object Detection in Aerial Images | Jan 10, 2024 | Domain AdaptationObject | CodeCode Available | 1 | 5 |
| CLIP-Lite: Information Efficient Visual Representation Learning with Language Supervision | Dec 14, 2021 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 | 5 |
| CyCLIP: Cyclic Contrastive Language-Image Pretraining | May 28, 2022 | Representation LearningVisual Reasoning | CodeCode Available | 1 | 5 |
| CLIP meets DINO for Tuning Zero-Shot Classifier using Unlabeled Image Collections | Nov 28, 2024 | image-classificationImage Classification | CodeCode Available | 1 | 5 |
| Latent Embedding Feedback and Discriminative Features for Zero-Shot Classification | Mar 17, 2020 | Action ClassificationClassification | CodeCode Available | 1 | 5 |
| CLIP Under the Microscope: A Fine-Grained Analysis of Multi-Object Representation | Feb 27, 2025 | Image-text matchingObject | CodeCode Available | 1 | 5 |
| Adversarial Robustification via Text-to-Image Diffusion Models | Jul 26, 2024 | Adversarial Robustnesszero-shot-classification | CodeCode Available | 1 | 5 |
| CALIP: Zero-Shot Enhancement of CLIP with Parameter-free Attention | Sep 28, 2022 | Training-free 3D Point Cloud ClassificationTransfer Learning | CodeCode Available | 1 | 5 |
| Learning Customized Visual Models with Retrieval-Augmented Knowledge | Jan 17, 2023 | Contrastive LearningRetrieval | CodeCode Available | 1 | 5 |
| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Aug 2, 2023 | Classificationimage-classification | CodeCode Available | 1 | 5 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 | 5 |
| Contrastive Language-Image Pre-training for the Italian Language | Aug 19, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 | 5 |
| From Local Details to Global Context: Advancing Vision-Language Models with Attention-Based Selection | May 19, 2025 | feature selectionOut-of-Distribution Generalization | CodeCode Available | 1 | 5 |
| Improved Probabilistic Image-Text Representations | May 29, 2023 | Data AugmentationImage-text matching | CodeCode Available | 1 | 5 |
| PRIOR: Prototype Representation Joint Learning from Medical Images and Reports | Jul 24, 2023 | Contrastive LearningImage to text | CodeCode Available | 1 | 5 |
| Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion | Feb 7, 2023 | ClassificationDiversity | CodeCode Available | 1 | 5 |