| VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching | Jan 1, 2023 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval | Aug 23, 2018 | Cross-Modal RetrievalImage-text Retrieval | —Unverified | 0 |
| Webly Supervised Joint Embedding for Cross-Modal lmage-Text Retrieval | Oct 1, 2018 | Cross-Modal RetrievalImage-text Retrieval | —Unverified | 0 |
| XGPT: Cross-modal Generative Pre-Training for Image Captioning | Mar 3, 2020 | Data AugmentationDenoising | —Unverified | 0 |
| Toward Automatic Relevance Judgment using Vision--Language Models for Image--Text Retrieval Evaluation | Aug 2, 2024 | Image-text RetrievalRetrieval | —Unverified | 0 |
| Re-Imagen: Retrieval-Augmented Text-to-Image Generator | Sep 29, 2022 | Image GenerationImage-text Retrieval | —Unverified | 0 |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Jul 17, 2024 | Image-text RetrievalObject | CodeCode Available | 0 |
| Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval | Nov 5, 2021 | Image-text RetrievalRetrieval | CodeCode Available | 0 |
| GSSF: Generalized Structural Sparse Function for Deep Cross-modal Metric Learning | Oct 20, 2024 | Image RetrievalImage-text Retrieval | CodeCode Available | 0 |
| NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings | Jan 7, 2023 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 0 |