| VisualCOMET: Reasoning about the Dynamic Context of a Still Image | Apr 22, 2020 | Visual Commonsense Reasoning | —Unverified | 0 |
| InterBERT: Vision-and-Language Interaction for Multi-modal Pretraining | Mar 30, 2020 | Image RetrievalImage-text matching | —Unverified | 0 |
| Connective Cognition Network for Directional Visual Commonsense Reasoning | Dec 1, 2019 | SentenceVisual Commonsense Reasoning | CodeCode Available | 0 |
| TAB-VCR: Tags and Attributes based VCR Baselines | Dec 1, 2019 | AttributeQuestion Answering | CodeCode Available | 0 |
| TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines | Oct 31, 2019 | AttributeQuestion Answering | CodeCode Available | 0 |
| Heterogeneous Graph Learning for Visual Commonsense Reasoning | Oct 25, 2019 | Graph LearningVisual Commonsense Reasoning | CodeCode Available | 0 |
| Enforcing Reasoning in Visual Commonsense Reasoning | Oct 21, 2019 | Question AnsweringReinforcement Learning | —Unverified | 0 |
| UNITER: Learning UNiversal Image-TExt Representations | Sep 25, 2019 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| UNITER: UNiversal Image-TExt Representation Learning | Sep 25, 2019 | Image-text matchingImage-text Retrieval | CodeCode Available | 1 |
| VL-BERT: Pre-training of Generic Visual-Linguistic Representations | Aug 22, 2019 | Image-text matchingLanguage Modelling | CodeCode Available | 1 |