Unshuffling Data for Improved Generalization in Visual Question Answering Jan 1, 2021 Out-of-Distribution Generalization Question Answering
— Unverified 0Linguistically Routing Capsule Network for Out-of-Distribution Visual Question Answering Jan 1, 2021 Novel Concepts Question Answering
— Unverified 0Erasure for Advancing: Dynamic Self-Supervised Learning for Commonsense Reasoning Jan 1, 2021 Question Answering Self-Supervised Learning
— Unverified 0Differentiable End-to-End Program Executor for Sample and Computationally Efficient VQA Jan 1, 2021 Question Answering Visual Question Answering
— Unverified 0Seeing is Knowing! Fact-based Visual Question Answering using Knowledge Graph Embeddings Dec 31, 2020 Common Sense Reasoning Knowledge Graph Embeddings
— Unverified 0Detecting Hate Speech in Multi-modal Memes Dec 29, 2020 Binary Classification Hate Speech Detection
Code Code Available 1LayoutLMv2: Multi-modal Pre-training for Visually-Rich Document Understanding Dec 29, 2020 Document Image Classification Document Layout Analysis
Code Code Available 0Learning content and context with language bias for Visual Question Answering Dec 21, 2020 Question Answering Visual Question Answering
Code Code Available 0Object-Centric Diagnosis of Visual Reasoning Dec 21, 2020 Diagnostic Object
— Unverified 0KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA Dec 20, 2020 Visual Question Answering (VQA)
— Unverified 0Trying Bilinear Pooling in Video-QA Dec 18, 2020 Question Answering Video Question Answering
— Unverified 0On Modality Bias in the TVQA Dataset Dec 18, 2020 Question Answering Video Question Answering
Code Code Available 0Overcoming Language Priors with Self-supervised Learning for Visual Question Answering Dec 17, 2020 Question Answering Self-Supervised Learning
Code Code Available 1Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding Dec 14, 2020 Question Answering Visual Question Answering
Code Code Available 1KVL-BERT: Knowledge Enhanced Visual-and-Linguistic BERT for Visual Commonsense Reasoning Dec 13, 2020 Sentence Visual Commonsense Reasoning
— Unverified 0Simple is not Easy: A Simple Strong Baseline for TextVQA and TextCaps Dec 9, 2020 Decoder Image Captioning
— Unverified 0Study on the Assessment of the Quality of Experience of Streaming Video Dec 8, 2020 regression Video Quality Assessment
Code Code Available 0TAP: Text-Aware Pre-training for Text-VQA and Text-Caption Dec 8, 2020 Caption Generation Language Modeling
Code Code Available 1CRAFT: A Benchmark for Causal Reasoning About Forces and inTeractions Dec 8, 2020 counterfactual Descriptive
Code Code Available 1FloodNet: A High Resolution Aerial Imagery Dataset for Post Flood Scene Understanding Dec 5, 2020 image-classification Image Classification
Code Code Available 1WeaQA: Weak Supervision via Captions for Visual Question Answering Dec 4, 2020 Question Answering Visual Question Answering
— Unverified 0Understanding Guided Image Captioning Performance across Domains Dec 4, 2020 Descriptive Image Captioning
Code Code Available 0Towards Knowledge-Augmented Visual Question Answering Dec 1, 2020 General Knowledge Graph Attention
Code Code Available 0A Unified Framework for Multilingual and Code-Mixed Visual Question Answering Dec 1, 2020 Question Answering Visual Question Answering
— Unverified 0Open-Ended Multi-Modal Relational Reasoning for Video Question Answering Dec 1, 2020 Question Answering Relational Reasoning
Code Code Available 0Just Ask: Learning to Answer Questions from Millions of Narrated Videos Dec 1, 2020 Question Answering Question Generation
Code Code Available 1Multimodal Graph Networks for Compositional Generalization in Visual Question Answering Dec 1, 2020 Graph Neural Network Question Answering
— Unverified 0Patch-VQ: 'Patching Up' the Video Quality Problem Nov 27, 2020 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 1Point and Ask: Incorporating Pointing into Visual Question Answering Nov 27, 2020 Question Answering Visual Question Answering
Code Code Available 1Learning from Lexical Perturbations for Consistent Visual Question Answering Nov 26, 2020 Question Answering Visual Question Answering
Code Code Available 0Transformation Driven Visual Reasoning Nov 26, 2020 Attribute Triplet
Code Code Available 1Siamese Tracking with Lingual Object Constraints Nov 23, 2020 Object Object Tracking
Code Code Available 0Large Scale Multimodal Classification Using an Ensemble of Transformer Models and Co-Attention Nov 23, 2020 Classification General Classification
Code Code Available 1Interpretable Visual Reasoning via Induced Symbolic Space Nov 23, 2020 Visual Question Answering (VQA) Visual Reasoning
Code Code Available 0Modular Graph Attention Network for Complex Visual Relational Reasoning Nov 22, 2020 Graph Attention Question Answering
— Unverified 0LRTA: A Transparent Neural-Symbolic Reasoning Framework with Modular Supervision for Visual Question Answering Nov 21, 2020 Answer Generation Question Answering
Code Code Available 1Logically Consistent Loss for Visual Question Answering Nov 19, 2020 Multi-Task Learning Question Answering
— Unverified 0Generating Natural Questions from Images for Multimodal Assistants Nov 17, 2020 Common Sense Reasoning Natural Questions
— Unverified 0CapWAP: Captioning with a Purpose Nov 9, 2020 Image Captioning Question Answering
— Unverified 0Learning to Model and Ignore Dataset Bias with Mixed Capacity Ensembles Nov 7, 2020 Natural Language Inference Question Answering
Code Code Available 0Disentangling 3D Prototypical Networks For Few-Shot Concept Learning Nov 6, 2020 3D geometry 3D Object Detection
Code Code Available 1An Improved Attention for Visual Question Answering Nov 4, 2020 Decoder Question Answering
Code Code Available 0Reasoning Over History: Context Aware Visual Dialog Nov 2, 2020 coreference-resolution Coreference Resolution
— Unverified 0Can Pre-training help VQA with Lexical Variations? Nov 1, 2020 Question Answering Visual Question Answering
— Unverified 0Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks Nov 1, 2020 Common Sense Reasoning Question Answering
— Unverified 0ConceptBert: Concept-Aware Representation for Visual Question Answering Nov 1, 2020 Common Sense Reasoning Question Answering
Code Code Available 1ISAAQ - Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention Nov 1, 2020 Multiple-choice Question Answering
— Unverified 0CapWAP: Image Captioning with a Purpose Nov 1, 2020 Image Captioning Question Answering
— Unverified 0Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering Nov 1, 2020 Contrastive Learning counterfactual
Code Code Available 1STL-CQA: Structure-based Transformers with Localization and Encoding for Chart Question Answering Nov 1, 2020 Chart Question Answering Question Answering
— Unverified 0