OmniNet: A unified architecture for multi-modal multi-task learning Jul 17, 2019 Image Captioning Multi-Task Learning
Code Code Available 0KVQA: Knowledge-Aware Visual Question Answering Jul 17, 2019 Knowledge Graphs Question Answering
— Unverified 02nd Place Solution to the GQA Challenge 2019 Jul 16, 2019 Question Answering Visual Question Answering
— Unverified 0Assessing Visual Quality of Omnidirectional Videos Jul 14, 2019 Visual Question Answering (VQA)
— Unverified 0Neural Reasoning, Fast and Slow, for Video Question Answering Jul 10, 2019 Natural Questions Question Answering
— Unverified 0Learning by Abstraction: The Neural State Machine Jul 9, 2019 Visual Question Answering (VQA) Visual Reasoning
Code Code Available 0Multi-grained Attention with Object-level Grounding for Visual Question Answering Jul 1, 2019 Object Question Answering
— Unverified 0Are Red Roses Red? Evaluating Consistency of Question-Answering Models Jul 1, 2019 Question Answering valid
Code Code Available 0ICDAR 2019 Competition on Scene Text Visual Question Answering Jun 30, 2019 Question Answering Visual Question Answering
— Unverified 0Deep Modular Co-Attention Networks for Visual Question Answering Jun 25, 2019 Question Answering Visual Question Answering
Code Code Available 0Adversarial Multimodal Network for Movie Question Answering Jun 24, 2019 Question Answering Video Question Answering
— Unverified 0Integrating Knowledge and Reasoning in Image Understanding Jun 24, 2019 Object Recognition Question Answering
— Unverified 0RUBi: Reducing Unimodal Biases in Visual Question Answering Jun 24, 2019 Question Answering Visual Question Answering
Code Code Available 0Investigating Biases in Textual Entailment Datasets Jun 23, 2019 BIG-bench Machine Learning Natural Language Inference
— Unverified 0Two-Level Approach for No-Reference Consumer Video Quality Assessment Jun 20, 2019 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects Jun 20, 2019 Question Answering Visual Question Answering
— Unverified 0Improving Visual Question Answering by Referring to Generated Paragraph Captions Jun 14, 2019 Decoder Image Captioning
— Unverified 0Mimic and Fool: A Task Agnostic Adversarial Attack Jun 11, 2019 Adversarial Attack Image Captioning
Code Code Available 0Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering Jun 10, 2019 Continual Learning Question Answering
— Unverified 0ActivityNet-QA: A Dataset for Understanding Complex Web Videos via Question Answering Jun 6, 2019 Question Answering Video Question Answering
Code Code Available 0Generating Question Relevant Captions to Aid Visual Question Answering Jun 3, 2019 General Knowledge Image Captioning
— Unverified 0Viewport Proposal CNN for 360deg Video Quality Assessment Jun 1, 2019 Saliency Prediction Video Quality Assessment
Code Code Available 0Dynamic Fusion With Intra- and Inter-Modality Attention Flow for Visual Question Answering Jun 1, 2019 Question Answering Visual Question Answering
— Unverified 0Grounded Word Sense Translation Jun 1, 2019 Grounded language learning Machine Translation
— Unverified 0ImageTTR: Grounding Type Theory with Records in Image Classification for Visual Question Answering Jun 1, 2019 General Classification image-classification
— Unverified 0What Can Neural Networks Reason About? May 30, 2019 Question Answering Visual Question Answering
Code Code Available 0Vision-to-Language Tasks Based on Attributes and Attention Mechanism May 29, 2019 Image Captioning Question Answering
— Unverified 0Leveraging Medical Visual Question Answering with Supporting Facts May 28, 2019 Diversity Medical Visual Question Answering
— Unverified 0Structure Learning for Neural Module Networks May 27, 2019 Question Answering Visual Question Answering
— Unverified 0Why do These Match? Explaining the Behavior of Image Similarity Models May 26, 2019 Attribute General Classification
Code Code Available 0Self-Critical Reasoning for Robust Visual Question Answering May 24, 2019 Question Answering Visual Question Answering
Code Code Available 0Aligning Visual Regions and Textual Concepts for Semantic-Grounded Image Representations May 15, 2019 Image Captioning Question Answering
Code Code Available 0Misleading Failures of Partial-input Baselines May 14, 2019 Natural Language Inference Visual Question Answering (VQA)
— Unverified 0Quantifying and Alleviating the Language Prior Problem in Visual Question Answering May 13, 2019 Information Retrieval Question Answering
Code Code Available 0Language-Conditioned Graph Networks for Relational Reasoning May 10, 2019 Object Referring Expression Comprehension
Code Code Available 0Visual TTR - Modelling Visual Question Answering in Type Theory with Records May 1, 2019 Question Answering Visual Question Answering
— Unverified 0Routing Networks and the Challenges of Modular and Compositional Computation Apr 29, 2019 Language Modeling Language Modelling
Code Code Available 0The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision Apr 26, 2019 Image-text Retrieval Object
Code Code Available 0Scene Graph Prediction with Limited Labels Apr 25, 2019 Knowledge Base Completion Prediction
Code Code Available 0Progressive Attention Memory Network for Movie Story Question Answering Apr 18, 2019 Question Answering Video Story QA
— Unverified 0Learning to Collocate Neural Modules for Image Captioning Apr 18, 2019 Decoder Image Captioning
— Unverified 0Question Guided Modular Routing Networks for Visual Question Answering Apr 17, 2019 Question Answering Visual Question Answering
— Unverified 0Evaluating the Representational Hub of Language and Vision Models Apr 12, 2019 Diagnostic Question Answering
— Unverified 0Factor Graph Attention Apr 11, 2019 Graph Attention Question Answering
Code Code Available 0Text Guided Person Image Synthesis Apr 10, 2019 Attribute Image Generation
— Unverified 0Multi-Target Embodied Question Answering Apr 9, 2019 Embodied Question Answering Navigate
Code Code Available 0Heterogeneous Memory Enhanced Multimodal Attention Model for Video Question Answering Apr 8, 2019 Question Answering Video Question Answering
Code Code Available 0Can You Explain That? Lucid Explanations Help Human-AI Collaborative Image Retrieval Apr 5, 2019 Image Retrieval Question Answering
— Unverified 0Actively Seeking and Learning from Live Data Apr 5, 2019 Domain Adaptation Meta-Learning
— Unverified 0MMED: A Multi-domain and Multi-modality Event Dataset Apr 4, 2019 Articles Question Answering
— Unverified 0