Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks Apr 13, 2020 Cross-Modal Retrieval Image Captioning
Code Code Available 2An Entropy Clustering Approach for Assessing Visual Question Difficulty Apr 12, 2020 Clustering Question Answering
Code Code Available 0YouMakeup VQA Challenge: Towards Fine-grained Action Understanding in Domain-Specific Videos Apr 12, 2020 Action Understanding Question Answering
Code Code Available 1Visual Grounding Methods for VQA are Working for the Wrong Reasons! Apr 12, 2020 Question Answering Visual Grounding
Code Code Available 1Rephrasing visual questions by specifying the entropy of the answer distribution Apr 10, 2020 Question Answering Visual Question Answering
— Unverified 0Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing Apr 8, 2020 Diversity Question Answering
— Unverified 0Evaluating Multimodal Representations on Visual Semantic Textual Similarity Apr 4, 2020 Benchmarking Image Captioning
Code Code Available 1Generating Rationales in Visual Question Answering Apr 4, 2020 Question Answering Visual Question Answering
— Unverified 0Pixel-BERT: Aligning Image Pixels with Text by Deep Multi-Modal Transformers Apr 2, 2020 Image-text matching Image-text Retrieval
Code Code Available 1X-Linear Attention Networks for Image Captioning Mar 31, 2020 Decoder Fine-Grained Visual Recognition
Code Code Available 1Multi-Modal Graph Neural Network for Joint Reasoning on Vision and Scene Text Mar 31, 2020 Graph Neural Network Question Answering
Code Code Available 1Assessing Image Quality Issues for Real-World Problems Mar 27, 2020 Image Captioning Question Answering
— Unverified 0P NP, at least in Visual Question Answering Mar 26, 2020 Question Answering Visual Question Answering
Code Code Available 0Linguistically Driven Graph Capsule Network for Visual Question Reasoning Mar 23, 2020 Question Answering Visual Question Answering
— Unverified 0Visual Question Answering for Cultural Heritage Mar 22, 2020 Question Answering Visual Question Answering
— Unverified 0Normalized and Geometry-Aware Self-Attention Network for Image Captioning Mar 19, 2020 Image Captioning Machine Translation
— Unverified 0RSVQA: Visual Question Answering for Remote Sensing Data Mar 16, 2020 Land Cover Classification Object Counting
— Unverified 0Ground Truth Evaluation of Neural Network Explanations with CLEVR-XAI Mar 16, 2020 Benchmarking Explainable Artificial Intelligence (XAI)
Code Code Available 1Counterfactual Samples Synthesizing for Robust Visual Question Answering Mar 14, 2020 counterfactual Question Answering
Code Code Available 1MQA: Answering the Question via Robotic Manipulation Mar 10, 2020 Imitation Learning Question Answering
Code Code Available 0PathVQA: 30000+ Questions for Medical Visual Question Answering Mar 7, 2020 AI Agent Medical Visual Question Answering
Code Code Available 1Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning Mar 6, 2020 Density Estimation Noise Estimation
Code Code Available 0XGPT: Cross-modal Generative Pre-Training for Image Captioning Mar 3, 2020 Data Augmentation Denoising
— Unverified 0A Question-Centric Model for Visual Question Answering in Medical Imaging Mar 2, 2020 Medical Image Analysis Question Answering
Code Code Available 0A Study on Multimodal and Interactive Explanations for Visual Question Answering Mar 1, 2020 Explainable Artificial Intelligence (XAI) Prediction
— Unverified 0Visual Commonsense R-CNN Feb 27, 2020 Image Captioning Representation Learning
Code Code Available 1Unshuffling Data for Improved Generalization Feb 27, 2020 Clustering Data Augmentation
— Unverified 0Hierarchical Conditional Relation Networks for Video Question Answering Feb 25, 2020 Audio-Visual Question Answering (AVQA) Question Answering
Code Code Available 1A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment Feb 25, 2020 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0What BERT Sees: Cross-Modal Transfer for Visual Question Generation Feb 25, 2020 Question Generation Question-Generation
— Unverified 0On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering Feb 24, 2020 Question Answering Referring Expression
— Unverified 0VQA-LOL: Visual Question Answering under the Lens of Logic Feb 19, 2020 Negation Question Answering
— Unverified 0CQ-VQA: Visual Question Answering on Categorized Questions Feb 17, 2020 Question Answering Visual Question Answering
— Unverified 0Sparse and Structured Visual Attention Feb 13, 2020 Image Captioning Question Answering
Code Code Available 0Component Analysis for Visual Question Answering Architectures Feb 12, 2020 Question Answering Representation Learning
— Unverified 0Multimodal fusion of imaging and genomics for lung cancer recurrence prediction Feb 5, 2020 Computed Tomography (CT) Question Answering
Code Code Available 1Break It Down: A Question Understanding Benchmark Jan 31, 2020 Open-Domain Question Answering Question Answering
Code Code Available 1Augmenting Visual Question Answering with Semantic Frame Information in a Multitask Learning Approach Jan 31, 2020 Question Answering Visual Question Answering
Code Code Available 0Uncertainty based Class Activation Maps for Visual Question Answering Jan 23, 2020 Deep Learning Probabilistic Deep Learning
— Unverified 0Robust Explanations for Visual Question Answering Jan 23, 2020 Question Answering Visual Question Answering
Code Code Available 0SQuINTing at VQA Models: Introspecting VQA Models with Sub-Questions Jan 20, 2020 Visual Question Answering (VQA)
— Unverified 0Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models Jan 20, 2020 Question Answering Visual Question Answering
— Unverified 0Recommending Themes for Ad Creative Design via Visual-Linguistic Representations Jan 20, 2020 Question Answering Recommendation Systems
Code Code Available 0Extending Class Activation Mapping Using Gaussian Receptive Field Jan 15, 2020 Deep Learning Image Classification
— Unverified 0Fine-grained Image Classification and Retrieval by Combining Visual and Locally Pooled Textual Features Jan 14, 2020 Classification Diversity
Code Code Available 1MHSAN: Multi-Head Self-Attention Network for Visual Semantic Embedding Jan 11, 2020 Image Captioning Image-text Retrieval
Code Code Available 0In Defense of Grid Features for Visual Question Answering Jan 10, 2020 Image Captioning Question Answering
Code Code Available 1Visual Question Answering on 360° Images Jan 10, 2020 Question Answering Visual Question Answering
— Unverified 0Think Locally, Act Globally: Federated Learning with Local and Global Representations Jan 6, 2020 Federated Learning Representation Learning
Code Code Available 1Multi-Layer Content Interaction Through Quaternion Product For Visual Question Answering Jan 3, 2020 Question Answering Video Description
— Unverified 0