| Learning Portrait Style Representations | Dec 8, 2020 | Image Generationzero-shot-classification | CodeCode Available | 0 |
| Zero-Shot Classification by Logical Reasoning on Natural Language Explanations | Nov 7, 2022 | ClassificationLogical Reasoning | CodeCode Available | 0 |
| Design of the topology for contrastive visual-textual alignment | Sep 5, 2022 | Contrastive LearningImage-to-Text Retrieval | CodeCode Available | 0 |
| Multi-level Cross-modal Feature Alignment via Contrastive Learning towards Zero-shot Classification of Remote Sensing Image Scenes | May 31, 2023 | ClassificationContrastive Learning | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers | Nov 10, 2023 | DecoderImage Segmentation | CodeCode Available | 0 |
| Automated Medical Report Generation for ECG Data: Bridging Medical Text and Signal Processing with Deep Learning | Dec 5, 2024 | Comment GenerationDecoder | CodeCode Available | 0 |
| Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention Networks | Dec 3, 2024 | ClassificationScene Classification | CodeCode Available | 0 |
| Learning Deep Representations of Fine-grained Visual Descriptions | May 17, 2016 | AttributeImage Retrieval | CodeCode Available | 0 |
| A Unified Debiasing Approach for Vision-Language Models across Modalities and Tasks | Oct 10, 2024 | FairnessImage Captioning | CodeCode Available | 0 |
| ZSCRGAN: A GAN-based Expectation Maximization Model for Zero-Shot Retrieval of Images from Textual Descriptions | Jul 23, 2020 | Cross-Modal Information RetrievalImage Retrieval | CodeCode Available | 0 |