| Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training | Aug 16, 2019 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision | Apr 26, 2019 | Image-text RetrievalObject | CodeCode Available | 0 |
| Deep Semantic Multimodal Hashing Network for Scalable Image-Text and Video-Text Retrievals | Jan 9, 2019 | Cross-Modal RetrievalDeep Hashing | —Unverified | 0 |
| Webly Supervised Joint Embedding for Cross-Modal lmage-Text Retrieval | Oct 1, 2018 | Cross-Modal RetrievalImage-text Retrieval | —Unverified | 0 |
| Webly Supervised Joint Embedding for Cross-Modal Image-Text Retrieval | Aug 23, 2018 | Cross-Modal RetrievalImage-text Retrieval | —Unverified | 0 |
| Learning Joint Embedding with Multimodal Cues for Cross-Modal Video-Text Retrieval | Jun 11, 2018 | Image-text RetrievalRetrieval | CodeCode Available | 0 |
| Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models | Nov 17, 2017 | Cross-Modal RetrievalImage-text Retrieval | —Unverified | 0 |
| Asymmetrically Weighted CCA And Hierarchical Kernel Sentence Embedding For Image & Text Retrieval | Nov 19, 2015 | Image-text RetrievalModel Selection | —Unverified | 0 |