| DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained Diffusion | Jan 23, 2023 | Image-text ClassificationNode Classification | CodeCode Available | 2 |
| GIST: Generating Image-Specific Text for Fine-grained Object Classification | Jul 21, 2023 | ClassificationFine-Grained Image Classification | CodeCode Available | 1 |
| UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning | May 16, 2023 | Contrastive LearningImage-text Classification | CodeCode Available | 1 |
| Towards Unifying Medical Vision-and-Language Pre-training via Soft Prompts | Feb 17, 2023 | Image RetrievalImage-text Classification | CodeCode Available | 1 |
| GLAMI-1M: A Multilingual Image-Text Fashion Dataset | Nov 17, 2022 | ClassificationImage Generation | CodeCode Available | 1 |
| Unified Generative and Discriminative Training for Multi-modal Large Language Models | Nov 1, 2024 | Dynamic Time WarpingImage-text Classification | —Unverified | 0 |
| Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real Data | Oct 29, 2024 | Data IntegrationImage-text Classification | CodeCode Available | 0 |
| Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality | Jun 16, 2024 | Federated LearningImage-text Classification | —Unverified | 0 |
| Robust Latent Representation Tuning for Image-text Classification | Jun 10, 2024 | ClassificationImage-text Classification | —Unverified | 0 |
| Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE | Jun 3, 2024 | Graph Neural NetworkImage-text Classification | —Unverified | 0 |