| Towards Fast and Accurate Image-Text Retrieval with Self-Supervised Fine-Grained Alignment | Aug 27, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 1 |
| Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval | Aug 24, 2023 | Cross-Modal RetrievalImage-text matching | CodeCode Available | 1 |
| ALIP: Adaptive Language-Image Pre-training with Synthetic Caption | Aug 16, 2023 | Action ClassificationImage-text Retrieval | CodeCode Available | 1 |
| AdvCLIP: Downstream-agnostic Adversarial Examples in Multimodal Contrastive Learning | Aug 14, 2023 | Contrastive LearningGenerative Adversarial Network | CodeCode Available | 1 |
| Set-level Guidance Attack: Boosting Adversarial Transferability of Vision-Language Pre-training Models | Jul 26, 2023 | Image-text RetrievalRetrieval | CodeCode Available | 1 |
| mCLIP: Multilingual CLIP via Cross-lingual Transfer | Jul 10, 2023 | Contrastive LearningCross-Lingual Transfer | CodeCode Available | 1 |
| Contrasting Intra-Modal and Ranking Cross-Modal Hard Negatives to Enhance Visio-Linguistic Compositional Understanding | Jun 15, 2023 | Contrastive Learningimage-classification | CodeCode Available | 1 |
| Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training | Jun 15, 2023 | Image-text RetrievalRepresentation Learning | CodeCode Available | 1 |
| Babel-ImageNet: Massively Multilingual Evaluation of Vision-and-Language Representations | Jun 14, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Global and Local Semantic Completion Learning for Vision-Language Pre-training | Jun 12, 2023 | cross-modal alignmentImage-text Retrieval | CodeCode Available | 1 |