| MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | May 9, 2025 | DiagnosticInstruction Following | CodeCode Available | 1 | 5 |
| Controlling Latent Diffusion Using Latent CLIP | Mar 11, 2025 | DenoisingDescriptive | CodeCode Available | 1 | 5 |
| Florence: A New Foundation Model for Computer Vision | Nov 22, 2021 | Action ClassificationAction Recognition | CodeCode Available | 1 | 5 |
| Robust Contrastive Language-Image Pre-training against Data Poisoning and Backdoor Attacks | Mar 13, 2023 | Backdoor AttackData Poisoning | CodeCode Available | 1 | 5 |
| MoralBERT: A Fine-Tuned Language Model for Capturing Moral Values in Social Discussions | Mar 12, 2024 | Domain AdaptationLanguage Modeling | CodeCode Available | 1 | 5 |
| PerceptionCLIP: Visual Classification by Inferring and Conditioning on Contexts | Aug 2, 2023 | Classificationimage-classification | CodeCode Available | 1 | 5 |
| SemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval) | Apr 13, 2023 | ClassificationSentiment Analysis | CodeCode Available | 1 | 5 |
| Discriminative Region-based Multi-Label Zero-Shot Learning | Aug 20, 2021 | Image RetrievalMulti-label zero-shot learning | CodeCode Available | 1 | 5 |
| ReCLIP: Refine Contrastive Language Image Pre-Training with Source Free Domain Adaptation | Aug 4, 2023 | Domain Adaptationimage-classification | CodeCode Available | 1 | 5 |
| Reproducible scaling laws for contrastive language-image learning | Dec 14, 2022 | Image ClassificationOpen Vocabulary Attribute Detection | CodeCode Available | 1 | 5 |
| Open-Pose 3D Zero-Shot Learning: Benchmark and Challenges | Dec 12, 2023 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| Differentiable Model Scaling using Differentiable Topk | May 12, 2024 | GPUimage-classification | CodeCode Available | 1 | 5 |
| CyCLIP: Cyclic Contrastive Language-Image Pretraining | May 28, 2022 | Representation LearningVisual Reasoning | CodeCode Available | 1 | 5 |
| Discovering Human Interactions With Novel Objects via Zero-Shot Learning | Jun 1, 2020 | Human-Object Interaction DetectionObject | CodeCode Available | 1 | 5 |
| Exploring Vision-Language Models for Imbalanced Learning | Apr 4, 2023 | Decoderzero-shot-classification | CodeCode Available | 1 | 5 |
| DC3DO: Diffusion Classifier for 3D Objects | Aug 13, 2024 | 3D Object ClassificationClassification | CodeCode Available | 1 | 5 |
| Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection | Nov 1, 2023 | ClassificationFew-Shot Object Detection | CodeCode Available | 1 | 5 |
| SenCLIP: Enhancing zero-shot land-use mapping for Sentinel-2 with ground-level prompting | Dec 11, 2024 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 1 | 5 |
| SuS-X: Training-Free Name-Only Transfer of Vision-Language Models | Nov 28, 2022 | Retrievalzero-shot-classification | CodeCode Available | 1 | 5 |
| Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP | Dec 1, 2024 | Natural Language Understandingzero-shot-classification | CodeCode Available | 0 | 5 |
| Design of the topology for contrastive visual-textual alignment | Sep 5, 2022 | Contrastive LearningImage-to-Text Retrieval | CodeCode Available | 0 | 5 |
| Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Mar 7, 2023 | nlg evaluationRepresentation Learning | CodeCode Available | 0 | 5 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 | 5 |
| OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning | May 24, 2023 | Data AugmentationFact Checking | CodeCode Available | 0 | 5 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 | 5 |
| On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction | Feb 28, 2024 | ClassificationNatural Language Inference | CodeCode Available | 0 | 5 |
| Data-Free Generalized Zero-Shot Learning | Jan 28, 2024 | Generalized Zero-Shot Learningzero-shot-classification | CodeCode Available | 0 | 5 |
| Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks | Oct 5, 2023 | Contrastive LearningData Poisoning | CodeCode Available | 0 | 5 |
| OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment | Mar 3, 2025 | Anomaly LocalizationClassification | CodeCode Available | 0 | 5 |
| Online Zero-Shot Classification with CLIP | Aug 23, 2024 | Classificationzero-shot-classification | CodeCode Available | 0 | 5 |
| On the effectiveness of Large Language Models in the mechanical design domain | May 2, 2025 | ClassificationSentence | CodeCode Available | 0 | 5 |
| Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Sep 3, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 | 5 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 | 5 |
| An Efficient Framework for Enhancing Discriminative Models via Diffusion Techniques | Dec 12, 2024 | Classificationimage-classification | CodeCode Available | 0 | 5 |
| Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Dec 3, 2024 | ClassificationScene Classification | CodeCode Available | 0 | 5 |
| Robustifying Point Cloud Networks by Refocusing | Aug 10, 2023 | 3D ClassificationAdversarial Defense | CodeCode Available | 0 | 5 |
| MoRE: Multi-Modal Contrastive Pre-training with Transformers on X-Rays, ECGs, and Diagnostic Report | Oct 21, 2024 | DiagnosticMedical Diagnosis | CodeCode Available | 0 | 5 |
| ModalChorus: Visual Probing and Alignment of Multi-modal Embeddings via Modal Fusion Map | Jul 17, 2024 | Cross-Modal RetrievalDimensionality Reduction | CodeCode Available | 0 | 5 |
| NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Oct 1, 2024 | Contrastive LearningEEG | CodeCode Available | 0 | 5 |
| Bayesian Modeling of Zero-Shot Classifications for Urban Flood Detection | Mar 18, 2025 | Uncertainty Quantificationzero-shot-classification | CodeCode Available | 0 | 5 |
| Mitigating Word Bias in Zero-shot Prompt-based Classifiers | Sep 10, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 | 5 |
| Connecting NeRFs, Images, and Text | Apr 11, 2024 | NeRFRepresentation Learning | CodeCode Available | 0 | 5 |
| AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis | Aug 4, 2024 | ClassificationDiagnostic | CodeCode Available | 0 | 5 |
| MobileCLIP: Fast Image-Text Models through Multi-Modal Reinforced Training | Nov 28, 2023 | Image CaptioningTransfer Learning | CodeCode Available | 0 | 5 |
| Non-Contrastive Learning Meets Language-Image Pre-Training | Oct 17, 2022 | Contrastive Learningdomain classification | CodeCode Available | 0 | 5 |
| Lex2Sent: A bagging approach to unsupervised sentiment analysis | Sep 26, 2022 | ClassificationDecoder | CodeCode Available | 0 | 5 |
| Linear Representations of Sentiment in Large Language Models | Oct 23, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 | 5 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers | Nov 10, 2023 | DecoderImage Segmentation | CodeCode Available | 0 | 5 |
| Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning | Dec 5, 2024 | Comment GenerationDecoder | CodeCode Available | 0 | 5 |
| AmorLIP: Efficient Language-Image Pretraining via Amortization | May 25, 2025 | Contrastive LearningRepresentation Learning | CodeCode Available | 0 | 5 |