| The style transformer with common knowledge optimization for image-text retrieval | Mar 1, 2023 | Image-text RetrievalRetrieval | —Unverified | 0 |
| Differentiable Outlier Detection Enable Robust Deep Multimodal Analysis | Feb 11, 2023 | Image-text RetrievalKnowledge Graphs | CodeCode Available | 0 |
| USER: Unified Semantic Enhancement with Momentum Contrast for Image-Text Retrieval | Jan 17, 2023 | Contrastive LearningImage-text Retrieval | CodeCode Available | 0 |
| HADA: A Graph-based Amalgamation Framework in Image-text Retrieval | Jan 11, 2023 | Graph Neural NetworkImage Retrieval | CodeCode Available | 0 |
| NAPReg: Nouns As Proxies Regularization for Semantically Aware Cross-Modal Embeddings | Jan 7, 2023 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 0 |
| VL-Match: Enhancing Vision-Language Pretraining with Token-Level and Instance-Level Matching | Jan 1, 2023 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Multilateral Semantic Relations Modeling for Image Text Retrieval | Jan 1, 2023 | Image-text RetrievalRetrieval | —Unverified | 0 |
| GAFNet: A Global Fourier Self Attention Based Novel Network for multi-modal downstream tasks | Jan 1, 2023 | Image GenerationImage-text Retrieval | —Unverified | 0 |
| ViLEM: Visual-Language Error Modeling for Image-Text Retrieval | Jan 1, 2023 | Contrastive LearningImage-text Retrieval | —Unverified | 0 |
| Efficient Image Captioning for Edge Devices | Dec 18, 2022 | CPUImage Captioning | —Unverified | 0 |