| Learning Portrait Style Representations | Dec 8, 2020 | Image Generationzero-shot-classification | CodeCode Available | 0 |
| Zero-Shot Classification by Logical Reasoning on Natural Language Explanations | Nov 7, 2022 | ClassificationLogical Reasoning | CodeCode Available | 0 |
| Design of the topology for contrastive visual-textual alignment | Sep 5, 2022 | Contrastive LearningImage-to-Text Retrieval | CodeCode Available | 0 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers | Nov 10, 2023 | DecoderImage Segmentation | CodeCode Available | 0 |
| Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning | Dec 5, 2024 | Comment GenerationDecoder | CodeCode Available | 0 |
| Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Dec 3, 2024 | ClassificationScene Classification | CodeCode Available | 0 |
| Learning Deep Representations of Fine-grained Visual Descriptions | May 17, 2016 | AttributeImage Retrieval | CodeCode Available | 0 |
| A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Oct 10, 2024 | FairnessImage Captioning | CodeCode Available | 0 |
| ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions | Jul 23, 2020 | Cross-Modal Information RetrievalImage Retrieval | CodeCode Available | 0 |
| What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | May 24, 2024 | Classificationimage-classification | CodeCode Available | 0 |
| NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Oct 1, 2024 | Contrastive LearningEEG | CodeCode Available | 0 |
| Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision? | Dec 11, 2024 | Prompt Learningzero-shot-classification | CodeCode Available | 0 |
| Boosting Visual-Language Models by Exploiting Hard Samples | May 9, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| Non-Contrastive Learning Meets Language-Image Pre-Training | Oct 17, 2022 | Contrastive Learningdomain classification | CodeCode Available | 0 |
| Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data | Sep 2, 2024 | Mortality Predictionzero-shot-classification | CodeCode Available | 0 |
| OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment | Mar 3, 2025 | Anomaly LocalizationClassification | CodeCode Available | 0 |
| Online Zero-Shot Classification with CLIP | Aug 23, 2024 | Classificationzero-shot-classification | CodeCode Available | 0 |
| On the effectiveness of Large Language Models in the mechanical design domain | May 2, 2025 | ClassificationSentence | CodeCode Available | 0 |
| On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction | Feb 28, 2024 | ClassificationNatural Language Inference | CodeCode Available | 0 |
| LAION-5B: An open large-scale dataset for training next generation image-text models | Oct 16, 2022 | Image GenerationPreference Mapping | CodeCode Available | 0 |
| Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training | May 5, 2024 | Domain AdaptationLanguage Modelling | CodeCode Available | 0 |
| KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jan 20, 2025 | Classificationimage-classification | CodeCode Available | 0 |
| Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Sep 3, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 |
| OverPrompt: Enhancing ChatGPT through Efficient In-Context Learning | May 24, 2023 | Data AugmentationFact Checking | CodeCode Available | 0 |
| Stacked Semantics-Guided Attention Model for Fine-Grained Zero-Shot Learning | Dec 1, 2018 | General ClassificationMulti-class Classification | CodeCode Available | 0 |
| Investigating the Emergent Audio Classification Ability of ASR Foundation Models | Nov 15, 2023 | Audio ClassificationDecoder | CodeCode Available | 0 |
| Improving Zero-Shot Detection of Low Prevalence Chest Pathologies using Domain Pre-trained Language Models | Jun 13, 2023 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| StarFT: Robust Fine-tuning of Zero-shot Models via Spuriosity Alignment | May 19, 2025 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP | Dec 1, 2024 | Natural Language Understandingzero-shot-classification | CodeCode Available | 0 |
| AdaCBM: An Adaptive Concept Bottleneck Model for Explainable and Accurate Diagnosis | Aug 4, 2024 | ClassificationDiagnostic | CodeCode Available | 0 |
| Understanding Visual Concepts Across Models | Jun 11, 2024 | Image Generationobject-detection | CodeCode Available | 0 |
| Describe me an Aucklet: Generating Grounded Perceptual Category Descriptions | Mar 7, 2023 | nlg evaluationRepresentation Learning | CodeCode Available | 0 |
| DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation | Jul 14, 2025 | DecoderGPU | CodeCode Available | 0 |
| I can listen but cannot read: An evaluation of two-tower multimodal systems for instrument recognition | Jul 25, 2024 | Instrument RecognitionRetrieval | CodeCode Available | 0 |
| Data-Free Generalized Zero-Shot Learning | Jan 28, 2024 | Generalized Zero-Shot Learningzero-shot-classification | CodeCode Available | 0 |
| Better Safe than Sorry: Pre-training CLIP against Targeted Data Poisoning and Backdoor Attacks | Oct 5, 2023 | Contrastive LearningData Poisoning | CodeCode Available | 0 |
| Uniformity First: Uniformity-aware Test-time Adaptation of Vision-language Models against Image Corruption | May 19, 2025 | Knowledge DistillationTest-time Adaptation | CodeCode Available | 0 |
| WYTIWYR: A User Intent-Aware Framework with Multi-modal Inputs for Visualization Retrieval | Apr 14, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| Harmony: A Joint Self-Supervised and Weakly-Supervised Framework for Learning General Purpose Visual Representations | May 23, 2024 | Contrastive LearningInstance Segmentation | CodeCode Available | 0 |
| Gradient Matching Generative Networks for Zero-Shot Learning | Jun 1, 2019 | Domain AdaptationGeneral Classification | CodeCode Available | 0 |
| Robustifying Point Cloud Networks by Refocusing | Aug 10, 2023 | 3D ClassificationAdversarial Defense | CodeCode Available | 0 |
| Task-Driven Modular Networks for Zero-Shot Compositional Learning | May 15, 2019 | AttributeNovel Concepts | CodeCode Available | 0 |
| Connecting NeRFs, Images, and Text | Apr 11, 2024 | NeRFRepresentation Learning | CodeCode Available | 0 |
| GLOV: Guided Large Language Models as Implicit Optimizers for Vision Language Models | Oct 8, 2024 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |
| Real-Time Cell Sorting with Scalable In Situ FPGA-Accelerated Deep Learning | Mar 16, 2025 | Cell DetectionClassification | CodeCode Available | 0 |
| Geodesic Multi-Modal Mixup for Robust Fine-Tuning | Mar 8, 2022 | Image Captioningzero-shot-classification | CodeCode Available | 0 |
| Telling Stories for Common Sense Zero-Shot Action Recognition | Sep 29, 2023 | Action RecognitionArticles | CodeCode Available | 0 |
| Tell Me What You Know About Sexism: Expert-LLM Interaction Strategies and Co-Created Definitions for Zero-Shot Sexism Detection | Apr 21, 2025 | zero-shot-classificationZero-Shot Learning | CodeCode Available | 0 |