| Assisting Scene Graph Generation with Self-Supervision | Aug 8, 2020 | Graph GenerationImage Captioning | —Unverified | 0 |
| Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision | Aug 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering | Aug 1, 2020 | ObjectQuestion Answering | —Unverified | 0 |
| REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering | Jul 27, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder | Jul 13, 2020 | Question AnsweringVisual Grounding | —Unverified | 0 |
| Applying recent advances in Visual Question Answering to Record Linkage | Jul 12, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Image Captioning with Compositional Neural Module Networks | Jul 10, 2020 | Image CaptioningQuestion Answering | —Unverified | 0 |
| IQ-VQA: Intelligent Visual Question Answering | Jul 8, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Visual Question Answering as a Multi-Task Problem | Jul 3, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Eliminating Catastrophic Interference with Biased Competition | Jul 3, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| The Impact of Explanations on AI Competency Prediction in VQA | Jul 2, 2020 | AI AgentLanguage Modeling | —Unverified | 0 |
| Scene Graph Reasoning for Visual Question Answering | Jul 2, 2020 | NavigateQuestion Answering | —Unverified | 0 |
| Aligned Dual Channel Graph Convolutional Network for Visual Question Answering | Jul 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Towards Visual Dialog for Radiology | Jul 1, 2020 | Question AnsweringVisual Dialog | —Unverified | 0 |
| Multimodal Neural Graph Memory Networks for Visual Question Answering | Jul 1, 2020 | Graph Neural NetworkQuestion Answering | —Unverified | 0 |
| Improving VQA and its Explanations \\ by Comparing Competing Explanations | Jun 28, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Self-Segregating and Coordinated-Segregating Transformer for Focused Deep Multi-Modular Network for Visual Question Answering | Jun 25, 2020 | DiversityQuestion Answering | —Unverified | 0 |
| Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning" | Jun 20, 2020 | Graph GenerationQuestion Answering | —Unverified | 0 |
| Mucko: Multi-Layer Cross-Modal Knowledge Reasoning for Fact-based Visual Question Answering | Jun 16, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| ORD: Object Relationship Discovery for Visual Dialogue Generation | Jun 15, 2020 | Dialogue GenerationGraph Attention | —Unverified | 0 |
| Exploring Weaknesses of VQA Models through Attribution Driven Insights | Jun 11, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Estimating semantic structure for the VQA answer space | Jun 10, 2020 | General ClassificationQuestion Answering | —Unverified | 0 |
| Counterfactual Vision and Language Learning | Jun 1, 2020 | counterfactualQuestion Answering | —Unverified | 0 |
| Multimodal grid features and cell pointers for Scene Text Visual Question Answering | Jun 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| TA-Student VQA: Multi-Agents Training by Self-Questioning | Jun 1, 2020 | DiversityQuestion Answering | —Unverified | 0 |
| On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law | May 19, 2020 | Model SelectionQuestion Answering | —Unverified | 0 |
| Visual Relationship Detection using Scene Graphs: A Survey | May 16, 2020 | Graph GenerationImage Generation | —Unverified | 0 |
| Visual Question Answering with Prior Class Semantics | May 4, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| A Corpus for Visual Question Answering Annotated with Frame Semantic Information | May 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Image Position Prediction in Multimodal Documents | May 1, 2020 | ArticlesCaption Generation | —Unverified | 0 |
| Pragmatic Issue-Sensitive Image Captioning | Apr 29, 2020 | DescriptiveImage Captioning | CodeCode Available | 0 |
| A Novel Attention-based Aggregation Function to Combine Vision and Language | Apr 27, 2020 | General ClassificationImage Captioning | —Unverified | 0 |
| Visual Question Answering Using Semantic Information from Image Descriptions | Apr 23, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision | Apr 20, 2020 | counterfactualimage-classification | —Unverified | 0 |
| Knowledge-Based Visual Question Answering in Videos | Apr 17, 2020 | Question AnsweringVideo Question Answering | —Unverified | 0 |
| An Entropy Clustering Approach for Assessing Visual Question Difficulty | Apr 12, 2020 | ClusteringQuestion Answering | CodeCode Available | 0 |
| Rephrasing visual questions by specifying the entropy of the answer distribution | Apr 10, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing | Apr 8, 2020 | DiversityQuestion Answering | —Unverified | 0 |
| Generating Rationales in Visual Question Answering | Apr 4, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Assessing Image Quality Issues for Real-World Problems | Mar 27, 2020 | Image CaptioningQuestion Answering | —Unverified | 0 |
| P NP, at least in Visual Question Answering | Mar 26, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Linguistically Driven Graph Capsule Network for Visual Question Reasoning | Mar 23, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering for Cultural Heritage | Mar 22, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Normalized and Geometry-Aware Self-Attention Network for Image Captioning | Mar 19, 2020 | Image CaptioningMachine Translation | —Unverified | 0 |
| RSVQA: Visual Question Answering for Remote Sensing Data | Mar 16, 2020 | Land Cover ClassificationObject Counting | —Unverified | 0 |
| MQA: Answering the Question via Robotic Manipulation | Mar 10, 2020 | Imitation LearningQuestion Answering | CodeCode Available | 0 |
| Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning | Mar 6, 2020 | Density EstimationNoise Estimation | CodeCode Available | 0 |
| A Question-Centric Model for Visual Question Answering in Medical Imaging | Mar 2, 2020 | Medical Image AnalysisQuestion Answering | CodeCode Available | 0 |
| A Study on Multimodal and Interactive Explanations for Visual Question Answering | Mar 1, 2020 | Explainable Artificial Intelligence (XAI)Prediction | —Unverified | 0 |
| Unshuffling Data for Improved Generalization | Feb 27, 2020 | ClusteringData Augmentation | —Unverified | 0 |