Loss re-scaling VQA: Revisiting the LanguagePrior Problem from a Class-imbalance View Oct 30, 2020 Face Recognition image-classification
Code Code Available 0Leveraging Visual Question Answering to Improve Text-to-Image Synthesis Oct 28, 2020 Auxiliary Learning Image Generation
— Unverified 0MMFT-BERT: Multimodal Fusion Transformer with BERT Encodings for Visual Question Answering Oct 27, 2020 Diagnostic Question Answering
Code Code Available 1ST-GREED: Space-Time Generalized Entropic Differences for Frame Rate Dependent Video Quality Prediction Oct 26, 2020 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 1Beyond VQA: Generating Multi-word Answer and Rationale to Visual Questions Oct 24, 2020 General Classification Multiple-choice
— Unverified 0RUArt: A Novel Text-Centered Solution for Text-Based Visual Question Answering Oct 24, 2020 Optical Character Recognition Optical Character Recognition (OCR)
Code Code Available 1Removing Bias in Multi-modal Classifiers: Regularization by Maximizing Functional Entropies Oct 21, 2020 Question Answering Visual Question Answering
Code Code Available 1Bayesian Attention Modules Oct 20, 2020 Image Captioning Machine Translation
Code Code Available 1SOrT-ing VQA Models : Contrastive Gradient Learning for Improved Consistency Oct 20, 2020 Question Answering Visual Grounding
Code Code Available 0Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering Oct 17, 2020 Question Answering Visual Question Answering
— Unverified 0New Ideas and Trends in Deep Multimodal Content Understanding: A Review Oct 16, 2020 Cross-Modal Retrieval Deep Learning
— Unverified 0Natural Language Rationales with Full-Stack Visual Reasoning: From Pixels to Semantic Frames to Commonsense Graphs Oct 15, 2020 Language Modeling Language Modelling
Code Code Available 1Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think! Oct 13, 2020 Diagnostic Image-text Classification
— Unverified 0Contrast and Classify: Training Robust VQA Models Oct 13, 2020 Contrastive Learning Data Augmentation
Code Code Available 1Interpretable Neural Computation for Real-World Compositional Visual Question Answering Oct 10, 2020 Question Answering Visual Question Answering
— Unverified 0Characterizing Datasets for Social Visual Question Answering, and the New TinySocial Dataset Oct 8, 2020 Question Answering Visual Question Answering
— Unverified 0Pathological Visual Question Answering Oct 6, 2020 AI Agent Question Answering
— Unverified 0Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering Oct 6, 2020 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0Attention Guided Semantic Relationship Parsing for Visual Question Answering Oct 5, 2020 Object Question Answering
— Unverified 0CAPTION: Correction by Analyses, POS-Tagging and Interpretation of Objects using only Nouns Oct 2, 2020 Image Captioning object-detection
— Unverified 0ISAAQ -- Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention Oct 1, 2020 Multiple-choice Question Answering
— Unverified 0Graph-based Heuristic Search for Module Selection Procedure in Neural Module Network Sep 30, 2020 Heuristic Search Question Answering
— Unverified 0Spatial Attention as an Interface for Image Captioning Models Sep 29, 2020 Image Captioning Question Answering
— Unverified 0Hierarchical Deep Multi-modal Network for Medical Visual Question Answering Sep 27, 2020 Descriptive Medical Visual Question Answering
Code Code Available 0Multiple interaction learning with question-type prior knowledge for constraining answer search space in visual question answering Sep 23, 2020 Question Answering Visual Question Answering
Code Code Available 0X-LXMERT: Paint, Caption and Answer Questions with Multi-Modal Transformers Sep 23, 2020 Image Captioning Image Generation
Code Code Available 1Regularizing Attention Networks for Anomaly Detection in Visual Question Answering Sep 21, 2020 Anomaly Detection Question Answering
— Unverified 0MUTANT: A Training Paradigm for Out-of-Distribution Generalization in Visual Question Answering Sep 18, 2020 Out-of-Distribution Generalization Question Answering
Code Code Available 1A Multimodal Memes Classification: A Survey and Open Research Issues Sep 17, 2020 Classification General Classification
— Unverified 0A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports Sep 3, 2020 Image-text Retrieval Medical Visual Question Answering
Code Code Available 1Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering Aug 31, 2020 Knowledge Graphs Question Answering
— Unverified 0A Dataset and Baselines for Visual Question Answering on Art Aug 28, 2020 Question Answering Question Generation
Code Code Available 1Visual Question Answering on Image Sets Aug 27, 2020 Question Answering Visual Question Answering
— Unverified 0No-Reference Video Quality Assessment Using Space-Time Chips Aug 23, 2020 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0Document Visual Question Answering Challenge 2020 Aug 20, 2020 Question Answering Retrieval
— Unverified 0Linguistically-aware Attention for Reducing the Semantic-Gap in Vision-Language Tasks Aug 18, 2020 Image Captioning Visual Question Answering (VQA)
— Unverified 0DeVLBert: Learning Deconfounded Visio-Linguistic Representations Aug 16, 2020 Image Retrieval Question Answering
Code Code Available 1Graph Edit Distance Reward: Learning to Edit Scene Graph Aug 15, 2020 Graph Matching Image Retrieval
— Unverified 0Assisting Scene Graph Generation with Self-Supervision Aug 8, 2020 Graph Generation Image Captioning
— Unverified 0TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering Aug 1, 2020 Object Question Answering
— Unverified 0Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision Aug 1, 2020 Question Answering Visual Question Answering
— Unverified 0Noise-Induced Barren Plateaus in Variational Quantum Algorithms Jul 28, 2020 Visual Question Answering (VQA)
Code Code Available 0REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering Jul 27, 2020 Question Answering Visual Question Answering
Code Code Available 0Contrastive Visual-Linguistic Pretraining Jul 26, 2020 Contrastive Learning regression
Code Code Available 0Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data Jul 24, 2020 Visual Dialog Visual Question Answering (VQA)
Code Code Available 0Spatially Aware Multimodal Transformers for TextVQA Jul 23, 2020 Optical Character Recognition (OCR) Spatial Reasoning
Code Code Available 1Semantic Equivalent Adversarial Data Augmentation for Visual Question Answering Jul 19, 2020 Adversarial Attack Data Augmentation
Code Code Available 1Knowledge-Based Video Question Answering with Unsupervised Scene Descriptions Jul 17, 2020 Question Answering Video Question Answering
Code Code Available 1Learning to Discretely Compose Reasoning Module Networks for Video Captioning Jul 17, 2020 Decoder Question Answering
Code Code Available 1Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder Jul 13, 2020 Question Answering Visual Grounding
— Unverified 0