UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training Apr 1, 2021 Image-text matching Image-text Retrieval
— Unverified 0Macroscopic Control of Text Generation for Image Captioning Jan 20, 2021 Diversity Image Captioning
— Unverified 0Similarity Reasoning and Filtration for Image-Text Matching Jan 5, 2021 Cross-Modal Retrieval Image Retrieval
Code Code Available 1VinVL: Revisiting Visual Representations in Vision-Language Models Jan 2, 2021 Image Captioning Image-text matching
Code Code Available 2Learning Dual Semantic Relations with Graph Attention for Image-Text Matching Oct 22, 2020 Cross-Modal Retrieval Graph Attention
Code Code Available 1MedICaT: A Dataset of Medical Images, Captions, and Textual References Oct 12, 2020 document understanding Image-text matching
Code Code Available 1Universal Weighting Metric Learning for Cross-Modal Matching Oct 7, 2020 Image-text matching Metric Learning
Code Code Available 1Contrastive Cross-Modal Pre-Training: A General Strategy for Small Sample Medical Imaging Oct 6, 2020 Image Classification Image-text matching
— Unverified 0Consensus-Aware Visual-Semantic Embedding for Image-Text Matching Jul 17, 2020 Image Captioning Image-text matching
Code Code Available 1A Novel Attention-based Aggregation Function to Combine Vision and Language Apr 27, 2020 General Classification Image Captioning
— Unverified 0Deep Multimodal Neural Architecture Search Apr 25, 2020 Decoder Image-text matching
Code Code Available 1Transformer Reasoning Network for Image-Text Matching and Retrieval Apr 20, 2020 Image Retrieval Image-text matching
Code Code Available 1Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks Apr 13, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 2Text-Guided Neural Image Inpainting Apr 7, 2020 Descriptive Image Generation
Code Code Available 1Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers Apr 2, 2020 Image-text matching Image-text Retrieval
Code Code Available 1More Grounded Image Captioning by Distilling Image-Text Matching Model Apr 1, 2020 Image Captioning Image-text matching
Code Code Available 1Graph Structured Network for Image-Text Matching Apr 1, 2020 Attribute Cross-Modal Retrieval
Code Code Available 1InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining Mar 30, 2020 Image Retrieval Image-text matching
— Unverified 0Adaptive Offline Quintuplet Loss for Image-Text Matching Mar 7, 2020 Image-text matching Text Matching
Code Code Available 1Expressing Objects just like Words: Recurrent Visual Embedding for Image-Text Matching Feb 20, 2020 Image-text matching Object
— Unverified 0ImageBERT: Cross-modal Pre-training with Large-scale Weak-supervised Image-Text Data Jan 22, 2020 Image Retrieval Image-text matching
— Unverified 0Learning fragment self-attention embeddings for image-text matching Oct 1, 2019 Image-text matching Sentence
Code Code Available 0UNITER: Learning UNiversal Image-TExt Representations Sep 25, 2019 Image-text matching Image-text Retrieval
— Unverified 0UNITER: UNiversal Image-TExt Representation Learning Sep 25, 2019 Image-text matching Image-text Retrieval
Code Code Available 1Learning Visual Relation Priors for Image-Text Matching and Image Captioning with Neural Scene Graph Generators Sep 22, 2019 Image Captioning Image-text matching
— Unverified 0Visual Semantic Reasoning for Image-Text Matching Sep 6, 2019 Cross-Modal Retrieval Image Retrieval
Code Code Available 1VL-BERT: Pre-training of Generic Visual-Linguistic Representations Aug 22, 2019 Image-text matching Language Modelling
Code Code Available 1Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training Aug 16, 2019 Image-text matching Image-text Retrieval
— Unverified 0Matching Images and Text with Multi-modal Tensor Fusion and Re-ranking Aug 12, 2019 Binary Classification General Classification
Code Code Available 0Knowledge Aware Semantic Concept Expansion for Image-Text Matching Aug 10, 2019 Common Sense Reasoning Content-Based Image Retrieval
— Unverified 0Position Focused Attention Network for Image-Text Matching Jul 23, 2019 Image-text matching Position
Code Code Available 0ParNet: Position-aware Aggregated Relation Network for Image-Text matching Jun 17, 2019 Image-text matching Position
— Unverified 0Deep Cross-Modal Projection Learning for Image-Text Matching Sep 1, 2018 Cross-Modal Retrieval Image-text matching
Code Code Available 0Stacked Cross Attention for Image-Text Matching Mar 21, 2018 Cross-Modal Retrieval Image Retrieval
Code Code Available 1AttnGAN: Fine-Grained Text to Image Generation with Attentional Generative Adversarial Networks Nov 28, 2017 Generative Adversarial Network Image Generation
Code Code Available 1Cross-modal Subspace Learning for Fine-grained Sketch-based Image Retrieval May 28, 2017 Cross-Modal Retrieval Image Retrieval
— Unverified 0Learning Two-Branch Neural Networks for Image-Text Matching Tasks Apr 11, 2017 Image-text matching Retrieval
Code Code Available 0Dual Attention Networks for Multimodal Reasoning and Matching Nov 2, 2016 Collaborative Inference Image-text matching
Code Code Available 0