Visual Superordinate Abstraction for Robust Concept Learning May 28, 2022 Attribute Question Answering
— Unverified 0V-Doc : Visual questions answers with Documents May 27, 2022 Question Answering Question Generation
— Unverified 0Avoiding Barren Plateaus with Classical Deep Neural Networks May 26, 2022 Visual Question Answering (VQA)
— Unverified 0Guiding Visual Question Answering with Attention Priors May 25, 2022 Question Answering Visual Grounding
— Unverified 0Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization May 24, 2022 Image Captioning Out-of-Distribution Generalization
— Unverified 0On Advances in Text Generation from Images Beyond Captioning: A Case Study in Self-Rationalization May 24, 2022 Descriptive Image Captioning
— Unverified 0VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering May 23, 2022 Knowledge Graphs Question Answering
— Unverified 0Making Video Quality Assessment Models Sensitive to Frame Rate Distortions May 21, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Gender and Racial Bias in Visual Question Answering Datasets May 17, 2022 Question Answering Visual Question Answering
— Unverified 0A Neuro-Symbolic ASP Pipeline for Visual Question Answering May 16, 2022 Question Answering Visual Question Answering
Code Code Available 0A Framework to Map VMAF with the Probability of Just Noticeable Difference between Video Encoding Recipes May 16, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Serving and Optimizing Machine Learning Workflows on Heterogeneous Infrastructures May 10, 2022 AutoML BIG-bench Machine Learning
— Unverified 0Joint learning of object graph and relation graph for visual question answering May 9, 2022 Attribute Graph Neural Network
— Unverified 0Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study May 7, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0From Easy to Hard: Learning Language-guided Curriculum for Visual Question Answering on Remote Sensing Data May 6, 2022 Question Answering Visual Question Answering
— Unverified 0QLEVR: A Diagnostic Dataset for Quantificational Language and Elementary Visual Reasoning May 6, 2022 Diagnostic Question Answering
Code Code Available 0LAWS: Look Around and Warm-Start Natural Gradient Descent for Quantum Neural Networks May 5, 2022 Combinatorial Optimization Visual Question Answering (VQA)
Code Code Available 0What is Right for Me is Not Yet Right for You: A Dataset for Grounding Relative Directions via Multi-Task Learning May 5, 2022 Multi-Task Learning Question Answering
Code Code Available 0Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering May 2, 2022 Decoder Image Captioning
— Unverified 0ViLMedic: a framework for research at the intersection of vision and language in medical AI May 1, 2022 Medical Visual Question Answering Question Answering
— Unverified 0Bridging the Gap between Recognition-level Pre-training and Commonsensical Vision-language Tasks May 1, 2022 Diversity Informativeness
— Unverified 0DuReader_vis: A Chinese Dataset for Open-domain Document Visual Question Answering May 1, 2022 document understanding Open-Domain Question Answering
— Unverified 0Vision-Language Pretraining: Current Trends and the Future May 1, 2022 Question Answering Representation Learning
— Unverified 0Multimodal Adaptive Distillation for Leveraging Unimodal Encoders for Vision-Language Tasks Apr 22, 2022 Question Answering Visual Commonsense Reasoning
— Unverified 0LayoutLMv3: Pre-training for Document AI with Unified Text and Image Masking Apr 18, 2022 cross-modal alignment Document AI
Code Code Available 0Attention Mechanism based Cognition-level Scene Understanding Apr 17, 2022 Question Answering Scene Understanding
— Unverified 0Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning Apr 15, 2022 Contrastive Learning Question Answering
— Unverified 0Question-Driven Graph Fusion Network For Visual Question Answering Apr 3, 2022 Graph Attention Object
— Unverified 0Co-VQA : Answering by Interactive Sub Question Sequence Apr 2, 2022 Question Answering Visual Question Answering
— Unverified 0Perceptual Quality Assessment of UGC Gaming Videos Mar 31, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0SimVQA: Exploring Simulated Environments for Visual Question Answering Mar 31, 2022 Data Augmentation Diversity
— Unverified 0VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers Mar 30, 2022 Question Answering Visual Commonsense Reasoning
Code Code Available 0Visual Mechanisms Inspired Efficient Transformers for Image and Video Quality Assessment Mar 28, 2022 Image Quality Assessment Video Quality Assessment
— Unverified 0Single-Stream Multi-Level Alignment for Vision-Language Pretraining Mar 27, 2022 Image-text Retrieval Question Answering
Code Code Available 0Subjective and Objective Analysis of Streamed Gaming Videos Mar 24, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering Mar 24, 2022 GPU Question Answering
Code Code Available 0Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering Mar 24, 2022 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 0WuDaoMM: A large-scale Multi-Modal Dataset for Pre-training models Mar 22, 2022 Image Captioning Image Generation
— Unverified 0Can you even tell left from right? Presenting a new challenge for VQA Mar 15, 2022 Question Answering Visual Question Answering
— Unverified 0CARETS: A Consistency And Robustness Evaluative Test Suite for VQA Mar 15, 2022 Negation Question Generation
Code Code Available 0CLIP Models are Few-shot Learners: Empirical Studies on VQA and Visual Entailment Mar 14, 2022 parameter-efficient fine-tuning Question Answering
— Unverified 0Enabling Multimodal Generation on CLIP via Vision-Language Knowledge Distillation Mar 12, 2022 Image Captioning Knowledge Distillation
— Unverified 0Barlow constrained optimization for Visual Question Answering Mar 7, 2022 Question Answering Visual Question Answering
Code Code Available 0Dynamic Key-value Memory Enhanced Multi-step Graph Reasoning for Knowledge-based Visual Question Answering Mar 6, 2022 Graph Attention Question Answering
Code Code Available 0Modeling Coreference Relations in Visual Dialog Mar 6, 2022 Question Answering Visual Dialog
— Unverified 0Recent, rapid advancement in visual question answering architecture: a review Mar 2, 2022 Question Answering Visual Question Answering
— Unverified 0Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment Mar 1, 2022 Retrieval Sentence
— Unverified 0Joint Answering and Explanation for Visual Commonsense Reasoning Feb 25, 2022 Knowledge Distillation Question Answering
Code Code Available 0On Modality Bias Recognition and Reduction Feb 25, 2022 Action Recognition Multi-modal Classification
Code Code Available 0Measuring CLEVRness: Blackbox testing of Visual Reasoning Models Feb 24, 2022 Benchmarking Diagnostic
— Unverified 0