| Learning to embed semantic similarity for joint image-text retrieval | Oct 7, 2022 | Image-text RetrievalMetric Learning | —Unverified | 0 |
| Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss | Oct 1, 2022 | image-classificationImage Classification | —Unverified | 0 |
| Re-Imagen: Retrieval-Augmented Text-to-Image Generator | Sep 29, 2022 | Image GenerationImage-text Retrieval | —Unverified | 0 |
| VL-Taboo: An Analysis of Attribute-based Zero-shot Capabilities of Vision-Language Models | Sep 12, 2022 | AttributeImage-text Retrieval | CodeCode Available | 0 |
| Revising Image-Text Retrieval via Multi-Modal Entailment | Aug 22, 2022 | Image-text RetrievalNatural Language Inference | —Unverified | 0 |
| CODER: Coupled Diversity-Sensitive Momentum Contrastive Learning for Image-Text Retrieval | Aug 21, 2022 | ClusteringContrastive Learning | —Unverified | 0 |
| VLMAE: Vision-Language Masked Autoencoder | Aug 19, 2022 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| Intra-Modal Constraint Loss For Image-Text Retrieval | Jul 11, 2022 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 0 |
| Dynamic Contrastive Distillation for Image-Text Retrieval | Jul 4, 2022 | Contrastive LearningGPU | —Unverified | 0 |
| VL-BEiT: Generative Vision-Language Pretraining | Jun 2, 2022 | image-classificationImage Classification | —Unverified | 0 |