NAAQA: A Neural Architecture for Acoustic Question Answering Jun 11, 2021 Acoustic Question Answering Question Answering
Code Code Available 0Supervising the Transfer of Reasoning Patterns in VQA Jun 10, 2021 PAC learning Transfer Learning
— Unverified 0Bayesian Attention Belief Networks Jun 9, 2021 Decoder Machine Translation
— Unverified 0PAM: Understanding Product Images in Cross Product Category Attribute Extraction Jun 8, 2021 Attribute Attribute Extraction
— Unverified 0Check It Again: Progressive Visual Question Answering via Visual Entailment Jun 8, 2021 Question Answering Visual Entailment
Code Code Available 1Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions Jun 8, 2021 Question Answering Visual Question Answering
— Unverified 0Human-Adversarial Visual Question Answering Jun 4, 2021 Question Answering Visual Question Answering
— Unverified 0Grounding Complex Navigational Instructions Using Scene Graphs Jun 3, 2021 Question Answering reinforcement-learning
— Unverified 0Semantic Aligned Multi-modal Transformer for Vision-LanguageUnderstanding: A Preliminary Study on Visual QA Jun 1, 2021 Question Answering Visual Question Answering
— Unverified 0Learning to Select Question-Relevant Relations for Visual Question Answering Jun 1, 2021 Graph Attention Question Answering
— Unverified 0MiniVQA - A resource to build your tailored VQA competition Jun 1, 2021 BIG-bench Machine Learning Visual Question Answering (VQA)
— Unverified 0CLEVR\_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images Jun 1, 2021 Question Answering Visual Question Answering
Code Code Available 0MIMOQA: Multimodal Input Multimodal Output Question Answering Jun 1, 2021 Question Answering Visual Question Answering
— Unverified 0EaSe: A Diagnostic Tool for VQA based on Answer Diversity Jun 1, 2021 Diagnostic Diversity
Code Code Available 0Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models Jun 1, 2021 Data Augmentation Question Answering
— Unverified 0LPF: A Language-Prior Feedback Objective Function for De-biased Visual Question Answering May 29, 2021 Question Answering Visual Question Answering
Code Code Available 0StructuralLM: Structural Pre-training for Form Understanding May 24, 2021 document-image-classification Document Image Classification
— Unverified 0Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training May 24, 2021 Image Captioning Medical Visual Question Answering
Code Code Available 1Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training May 21, 2021 Question Answering Relation
— Unverified 0Multiple Meta-model Quantifying for Medical Visual Question Answering May 19, 2021 Medical Visual Question Answering Meta-Learning
Code Code Available 1NExT-QA:Next Phase of Question-Answering to Explaining Temporal Actions May 18, 2021 Question Answering Video Question Answering
Code Code Available 1Survey of Visual-Semantic Embedding Methods for Zero-Shot Image Retrieval May 16, 2021 Graph Generation Image Captioning
— Unverified 0Show Why the Answer is Correct! Towards Explainable AI using Compositional Temporal Attention May 15, 2021 Question Answering Visual Question Answering
— Unverified 0Cross-Modal Generative Augmentation for Visual Question Answering May 11, 2021 Data Augmentation Question Answering
— Unverified 0Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules May 11, 2021 Question Answering Visual Question Answering
Code Code Available 1Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning May 10, 2021 Arithmetic Reasoning Geometry Problem Solving
Code Code Available 1Passage Retrieval for Outside-Knowledge Visual Question Answering May 9, 2021 Image Captioning Object
Code Code Available 1AdaVQA: Overcoming Language Priors with Adapted Margin Cosine Loss May 5, 2021 Question Answering Visual Question Answering
Code Code Available 0Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention May 5, 2021 Question Answering Referring Expression
— Unverified 0Iterated learning for emergent systematicity in VQA May 3, 2021 Question Answering Systematic Generalization
— Unverified 0A survey on VQA_Datasets and Approaches May 2, 2021 Question Answering Survey
— Unverified 0Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads Apr 30, 2021 Question Answering Visual Question Answering
— Unverified 0Optimal training of variational quantum algorithms without barren plateaus Apr 29, 2021 Quantum Machine Learning Visual Question Answering (VQA)
Code Code Available 0Document Collection Visual Question Answering Apr 27, 2021 document understanding Question Answering
— Unverified 0InfographicVQA Apr 26, 2021 Question Answering Visual Question Answering
— Unverified 0MDETR -- Modulated Detection for End-to-End Multi-Modal Understanding Apr 26, 2021 Generalized Referring Expression Comprehension Phrase Grounding
Code Code Available 1RelTransformer: A Transformer-Based Long-Tail Visual Relationship Recognition Apr 24, 2021 Image Captioning Object Recognition
Code Code Available 1Playing Lottery Tickets with Vision and Language Apr 23, 2021 Image-text Retrieval Question Answering
— Unverified 0GraghVQA: Language-Guided Graph Neural Networks for Graph-based Visual Question Answering Apr 20, 2021 Graph Neural Network Graph Question Answering
Code Code Available 1Cross-Modal Retrieval Augmentation for Multi-Modal Classification Apr 16, 2021 Classification Cross-Modal Retrieval
— Unverified 0VGNMN: Video-grounded Neural Module Network to Video-Grounded Language Tasks Apr 16, 2021 Information Retrieval Question Answering
— Unverified 0Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention Apr 14, 2021 Question Answering Visual Question Answering
— Unverified 0Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation Apr 13, 2021 Knowledge Distillation Triplet
— Unverified 0CLEVR_HYP: A Challenge Dataset and Baselines for Visual Question Answering with Hypothetical Actions over Images Apr 13, 2021 Question Answering Visual Question Answering
Code Code Available 0Neuro-Symbolic VQA: A review from the perspective of AGI desiderata Apr 13, 2021 Question Answering Visual Question Answering
— Unverified 0How Transferable are Reasoning Patterns in VQA? Apr 8, 2021 Question Answering Visual Question Answering
— Unverified 0Multimodal Continuous Visual Attention Mechanisms Apr 7, 2021 Clustering Question Answering
— Unverified 0Beyond Question-Based Biases: Assessing Multimodal Shortcut Learning in Visual Question Answering Apr 7, 2021 Question Answering Visual Question Answering
Code Code Available 1Compressing Visual-linguistic Model via Knowledge Distillation Apr 5, 2021 Image Captioning Knowledge Distillation
— Unverified 0MMBERT: Multimodal BERT Pretraining for Improved Medical VQA Apr 3, 2021 Language Modeling Language Modelling
Code Code Available 1