| EaSe: A Diagnostic Tool for VQA based on Answer Diversity | Jun 1, 2021 | DiagnosticDiversity | CodeCode Available | 0 |
| Learning to Select Question-Relevant Relations for Visual Question Answering | Jun 1, 2021 | Graph AttentionQuestion Answering | —Unverified | 0 |
| LPF: A Language-Prior Feedback Objective Function for De-biased Visual Question Answering | May 29, 2021 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| StructuralLM: Structural Pre-training for Form Understanding | May 24, 2021 | document-image-classificationDocument Image Classification | —Unverified | 0 |
| Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training | May 21, 2021 | Question AnsweringRelation | —Unverified | 0 |
| Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval | May 16, 2021 | Graph GenerationImage Captioning | —Unverified | 0 |
| Show Why the Answer is Correct! Towards Explainable AI using Compositional Temporal Attention | May 15, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Cross-Modal Generative Augmentation for Visual Question Answering | May 11, 2021 | Data AugmentationQuestion Answering | —Unverified | 0 |
| Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention | May 5, 2021 | Question AnsweringReferring Expression | —Unverified | 0 |
| AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss | May 5, 2021 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Iterated learning for emergent systematicity in VQA | May 3, 2021 | Question AnsweringSystematic Generalization | —Unverified | 0 |
| A survey on VQA_Datasets and Approaches | May 2, 2021 | Question AnsweringSurvey | —Unverified | 0 |
| Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads | Apr 30, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Document Collection Visual Question Answering | Apr 27, 2021 | document understandingQuestion Answering | —Unverified | 0 |
| InfographicVQA | Apr 26, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Playing Lottery Tickets with Vision and Language | Apr 23, 2021 | Image-text RetrievalQuestion Answering | —Unverified | 0 |
| VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks | Apr 16, 2021 | Information RetrievalQuestion Answering | —Unverified | 0 |
| Cross-Modal Retrieval Augmentation for Multi-Modal Classification | Apr 16, 2021 | ClassificationCross-Modal Retrieval | —Unverified | 0 |
| Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention | Apr 14, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Neuro-Symbolic VQA: A review from the perspective of AGI desiderata | Apr 13, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images | Apr 13, 2021 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| How Transferable are Reasoning Patterns in VQA? | Apr 8, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Multimodal Continuous Visual Attention Mechanisms | Apr 7, 2021 | ClusteringQuestion Answering | —Unverified | 0 |
| Compressing Visual-linguistic Model via Knowledge Distillation | Apr 5, 2021 | Image CaptioningKnowledge Distillation | —Unverified | 0 |
| `Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks | Apr 1, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training | Apr 1, 2021 | Image-text matchingImage-text Retrieval | —Unverified | 0 |
| Analysis on Image Set Visual Question Answering | Mar 31, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Domain-robust VQA with diverse datasets and methods but no target labels | Mar 29, 2021 | Domain AdaptationObject Recognition | —Unverified | 0 |
| 'Just because you are right, doesn't mean I am wrong': Overcoming a Bottleneck in the Development and Evaluation of Open-Ended Visual Question Answering (VQA) Tasks | Mar 28, 2021 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models | Mar 26, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Grounding Strategies for Text-Only Natural Language Processing | Mar 25, 2021 | Image RetrievalLanguage Modeling | —Unverified | 0 |
| How to Design Sample and Computationally Efficient VQA Models | Mar 22, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Automatic Generation of Contrast Sets from Scene Graphs: Probing the Compositional Consistency of GQA | Mar 17, 2021 | Question AnsweringRelational Reasoning | CodeCode Available | 0 |
| A Comprehensive Survey of Scene Graphs: Generation and Application | Mar 17, 2021 | Image CaptioningQuestion Answering | —Unverified | 0 |
| Characterizing Misclassifications of Deep NLP Models | Mar 12, 2021 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 |
| RL-CSDia: Representation Learning of Computer Science Diagrams | Mar 10, 2021 | Question AnsweringRepresentation Learning | —Unverified | 0 |
| Select, Substitute, Search: A New Benchmark for Knowledge-Augmented Visual Question Answering | Mar 9, 2021 | Optical Character Recognition (OCR)Question Answering | CodeCode Available | 0 |
| Contextual Dropout: An Efficient Sample-Dependent Dropout Module | Mar 6, 2021 | image-classificationImage Classification | CodeCode Available | 0 |
| Visual Question Answering: which investigated applications? | Mar 4, 2021 | Image CaptioningQuestion Answering | CodeCode Available | 0 |
| Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues | Mar 1, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Learning Compositional Representation for Few-shot Visual Question Answering | Feb 21, 2021 | AttributeQuestion Answering | —Unverified | 0 |
| Answer Questions with Right Image Regions: A Visual Attention Regularization Approach | Feb 3, 2021 | Question AnsweringVisual Grounding | CodeCode Available | 0 |
| An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games | Jan 31, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Unanswerable Questions about Images and Texts | Jan 25, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering based on Local-Scene-Aware Referring Expression Generation | Jan 22, 2021 | Question AnsweringReferring Expression | —Unverified | 0 |
| Understanding in Artificial Intelligence | Jan 17, 2021 | Natural Language UnderstandingQuestion Answering | —Unverified | 0 |
| Latent Variable Models for Visual Question Answering | Jan 16, 2021 | BenchmarkingQuestion Answering | —Unverified | 0 |
| Understanding the Role of Scene Graphs in Visual Question Answering | Jan 14, 2021 | Graph GenerationQuestion Answering | —Unverified | 0 |
| Predicting Relative Depth between Objects from Semantic Features | Jan 12, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Self Supervision for Attention Networks | Jan 6, 2021 | image-classificationImage Classification | CodeCode Available | 0 |