| MUREL: Multimodal Relational Reasoning for Visual Question Answering | Feb 25, 2019 | Relational ReasoningVisual Question Answering | CodeCode Available | 0 |
| Dual Attention Networks for Visual Reference Resolution in Visual Dialog | Feb 25, 2019 | AI AgentQuestion Answering | CodeCode Available | 0 |
| Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering | Feb 21, 2019 | counterfactualQuestion Answering | —Unverified | 0 |
| Generating Natural Language Explanations for Visual Question Answering using Scene Graphs and Visual Attention | Feb 15, 2019 | Explanation GenerationLanguage Modeling | —Unverified | 0 |
| Cycle-Consistency for Robust Visual Question Answering | Feb 15, 2019 | Question AnsweringQuestion Generation | —Unverified | 0 |
| Taking a HINT: Leveraging Explanations to Make Vision and Language Models More Grounded | Feb 11, 2019 | Image CaptioningQuestion Answering | —Unverified | 0 |
| VrR-VG: Refocusing Visually-Relevant Relationships | Feb 1, 2019 | Image CaptioningQuestion Answering | —Unverified | 0 |
| BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection | Jan 31, 2019 | Question AnsweringRelationship Detection | CodeCode Available | 0 |
| Visual Entailment: A Novel Task for Fine-Grained Image Understanding | Jan 20, 2019 | Natural Language InferenceQuestion Answering | —Unverified | 0 |
| CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions | Jan 3, 2019 | DiagnosticImage Segmentation | CodeCode Available | 0 |
| The meaning of "most" for visual question answering models | Dec 31, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Scene Graph Reasoning with Prior Visual Relationship for Visual Question Answering | Dec 23, 2018 | Cross-Modal Information RetrievalInformation Retrieval | —Unverified | 0 |
| Focal Visual-Text Attention for Memex Question Answering | Dec 14, 2018 | Memex Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Dynamic Fusion with Intra- and Inter- Modality Attention Flow for Visual Question Answering | Dec 13, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Spatial Knowledge Distillation to aid Visual Reasoning | Dec 10, 2018 | DiagnosticKnowledge Distillation | —Unverified | 0 |
| Learning Representations of Sets through Optimized Permutations | Dec 10, 2018 | General ClassificationQuestion Answering | CodeCode Available | 0 |
| Recursive Visual Attention in Visual Dialog | Dec 6, 2018 | Question AnsweringVisual Dialog | CodeCode Available | 0 |
| Multi-task Learning of Hierarchical Vision-Language Representation | Dec 3, 2018 | Multi-Task LearningQuestion Answering | —Unverified | 0 |
| Learning to Specialize with Knowledge Distillation for Visual Question Answering | Dec 1, 2018 | General ClassificationGeneral Knowledge | —Unverified | 0 |
| Chain of Reasoning for Visual Question Answering | Dec 1, 2018 | ObjectQuestion Answering | —Unverified | 0 |
| From Known to the Unknown: Transferring Knowledge to Answer Questions about Novel Visual and Semantic Concepts | Nov 30, 2018 | Novel ConceptsQuestion Answering | —Unverified | 0 |
| Visual Question Answering as Reading Comprehension | Nov 29, 2018 | Common Sense ReasoningGeneral Knowledge | —Unverified | 0 |
| CLEAR: A Dataset for Compositional Language and Elementary Acoustic Reasoning | Nov 26, 2018 | Acoustic Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Visual Entailment Task for Visually-Grounded Language Learning | Nov 26, 2018 | Grounded language learningNatural Language Inference | —Unverified | 0 |
| A dataset of clinically generated visual questions and answers about radiology images | Nov 20, 2018 | Decision MakingMedical Visual Question Answering | —Unverified | 0 |