| VisualCOMET: Reasoning about the Dynamic Context of a Still Image | Apr 22, 2020 | Visual Commonsense Reasoning | —Unverified | 0 |
| InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining | Mar 30, 2020 | Image RetrievalImage-text matching | —Unverified | 0 |
| Connective Cognition Network for Directional Visual Commonsense Reasoning | Dec 1, 2019 | SentenceVisual Commonsense Reasoning | CodeCode Available | 0 |
| TAB-VCR: Tags and Attributes based VCR Baselines | Dec 1, 2019 | AttributeQuestion Answering | CodeCode Available | 0 |
| TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines | Oct 31, 2019 | AttributeQuestion Answering | CodeCode Available | 0 |
| Heterogeneous Graph Learning for Visual Commonsense Reasoning | Oct 25, 2019 | Graph LearningVisual Commonsense Reasoning | CodeCode Available | 0 |
| Enforcing Reasoning in Visual Commonsense Reasoning | Oct 21, 2019 | Question AnsweringReinforcement Learning | —Unverified | 0 |
| UNITER: Learning UNiversal Image-TExt Representations | Sep 25, 2019 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| UNITER: UNiversal Image-TExt Representation Learning | Sep 25, 2019 | Image-text matchingImage-text Retrieval | CodeCode Available | 1 |
| VL-BERT: Pre-training of Generic Visual-Linguistic Representations | Aug 22, 2019 | Image-text matchingLanguage Modelling | CodeCode Available | 1 |
| Unicoder-VL: A Universal Encoder for Vision and Language by Cross-modal Pre-training | Aug 16, 2019 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Fusion of Detected Objects in Text for Visual Question Answering | Aug 14, 2019 | Question AnsweringVisual Commonsense Reasoning | CodeCode Available | 0 |
| ViLBERT: Pretraining Task-Agnostic Visiolinguistic Representations for Vision-and-Language Tasks | Aug 6, 2019 | Image RetrievalQuestion Answering | CodeCode Available | 1 |
| From Recognition to Cognition: Visual Commonsense Reasoning | Nov 27, 2018 | Multiple-choiceMultiple Choice Question Answering (MCQA) | CodeCode Available | 0 |
| Think Visually: Question Answering through Virtual Imagery | May 25, 2018 | Question AnsweringVisual Commonsense Reasoning | CodeCode Available | 0 |