| Learning Portrait Style Representations | Dec 8, 2020 | Image Generationzero-shot-classification | CodeCode Available | 0 |
| Zero-Shot Classification by Logical Reasoning on Natural Language Explanations | Nov 7, 2022 | ClassificationLogical Reasoning | CodeCode Available | 0 |
| Design of the topology for contrastive visual-textual alignment | Sep 5, 2022 | Contrastive LearningImage-to-Text Retrieval | CodeCode Available | 0 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers | Nov 10, 2023 | DecoderImage Segmentation | CodeCode Available | 0 |
| Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning | Dec 5, 2024 | Comment GenerationDecoder | CodeCode Available | 0 |
| Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Dec 3, 2024 | ClassificationScene Classification | CodeCode Available | 0 |
| Learning Deep Representations of Fine-grained Visual Descriptions | May 17, 2016 | AttributeImage Retrieval | CodeCode Available | 0 |
| A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Oct 10, 2024 | FairnessImage Captioning | CodeCode Available | 0 |
| ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions | Jul 23, 2020 | Cross-Modal Information RetrievalImage Retrieval | CodeCode Available | 0 |
| What Do You See? Enhancing Zero-Shot Image Classification with Multimodal Large Language Models | May 24, 2024 | Classificationimage-classification | CodeCode Available | 0 |
| NECOMIMI: Neural-Cognitive Multimodal EEG-informed Image Generation with Diffusion Models | Oct 1, 2024 | Contrastive LearningEEG | CodeCode Available | 0 |
| Can Graph Neural Networks Learn Language with Extremely Weak Text Supervision? | Dec 11, 2024 | Prompt Learningzero-shot-classification | CodeCode Available | 0 |
| Boosting Visual-Language Models by Exploiting Hard Samples | May 9, 2023 | Retrievalzero-shot-classification | CodeCode Available | 0 |
| Non-Contrastive Learning Meets Language-Image Pre-Training | Oct 17, 2022 | Contrastive Learningdomain classification | CodeCode Available | 0 |
| Large Language Models versus Classical Machine Learning: Performance in COVID-19 Mortality Prediction Using High-Dimensional Tabular Data | Sep 2, 2024 | Mortality Predictionzero-shot-classification | CodeCode Available | 0 |
| OFF-CLIP: Improving Normal Detection Confidence in Radiology CLIP with Simple Off-Diagonal Term Auto-Adjustment | Mar 3, 2025 | Anomaly LocalizationClassification | CodeCode Available | 0 |
| Online Zero-Shot Classification with CLIP | Aug 23, 2024 | Classificationzero-shot-classification | CodeCode Available | 0 |
| On the effectiveness of Large Language Models in the mechanical design domain | May 2, 2025 | ClassificationSentence | CodeCode Available | 0 |
| On the use of Silver Standard Data for Zero-shot Classification Tasks in Information Extraction | Feb 28, 2024 | ClassificationNatural Language Inference | CodeCode Available | 0 |
| LAION-5B: An open large-scale dataset for training next generation image-text models | Oct 16, 2022 | Image GenerationPreference Mapping | CodeCode Available | 0 |
| Source-Free Domain Adaptation Guided by Vision and Vision-Language Pre-Training | May 5, 2024 | Domain AdaptationLanguage Modelling | CodeCode Available | 0 |
| KPL: Training-Free Medical Knowledge Mining of Vision-Language Models | Jan 20, 2025 | Classificationimage-classification | CodeCode Available | 0 |
| Optimizing CLIP Models for Image Retrieval with Maintained Joint-Embedding Alignment | Sep 3, 2024 | Image RetrievalRetrieval | CodeCode Available | 0 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 |