Avoiding Barren Plateaus with Classical Deep Neural Networks May 26, 2022 Visual Question Answering (VQA)
— Unverified 0Analysis of Visual Question Answering Algorithms with attention model May 4, 2023 Question Answering Visual Question Answering
— Unverified 0Curriculum Script Distillation for Multilingual Visual Question Answering Jan 17, 2023 Question Answering Visual Question Answering
— Unverified 0Curriculum reinforcement learning for quantum architecture search under hardware errors Feb 5, 2024 3D Architecture Computational Efficiency
— Unverified 0A Visual Question Answering Method for SAR Ship: Breaking the Requirement for Multimodal Dataset Construction and Model Fine-Tuning Nov 3, 2024 object-detection Object Detection
— Unverified 0Inverse Visual Question Answering with Multi-Level Attentions Sep 17, 2019 Question Answering Visual Question Answering
— Unverified 0Curriculum Learning for Compositional Visual Reasoning Mar 27, 2023 Question Answering Visual Question Answering
— Unverified 0Curriculum Learning Effectively Improves Low Data VQA Dec 1, 2021 Question Answering Visual Question Answering
— Unverified 0A Vision Centric Remote Sensing Benchmark Mar 20, 2025 Question Answering Representation Learning
— Unverified 0CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering May 22, 2025 Computed Tomography (CT) Question Answering
— Unverified 0AVIS: Autonomous Visual Information Seeking with Large Language Model Agent Jun 13, 2023 Decision Making Language Modeling
— Unverified 0CS-VQA: Visual Question Answering with Compressively Sensed Images Jun 8, 2018 Question Answering Visual Question Answering
— Unverified 0CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization Nov 1, 2021 Answer Generation Question-Answer-Generation
— Unverified 0Auto-Parsing Network for Image Captioning and Visual Question Answering Aug 24, 2021 Image Captioning Question Answering
— Unverified 0A Multimodal Memes Classification: A Survey and Open Research Issues Sep 17, 2020 Classification General Classification
— Unverified 0A dataset of clinically generated visual questions and answers about radiology images Nov 20, 2018 Decision Making Medical Visual Question Answering
— Unverified 02nd Place Solution to the GQA Challenge 2019 Jul 16, 2019 Question Answering Visual Question Answering
— Unverified 0Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool Mar 16, 2018 Question Answering Reinforcement Learning
— Unverified 0ISAAQ -- Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention Oct 1, 2020 Multiple-choice Question Answering
— Unverified 0Joint learning of object graph and relation graph for visual question answering May 9, 2022 Attribute Graph Neural Network
— Unverified 0Cross-Modal Retrieval Augmentation for Multi-Modal Classification Apr 16, 2021 Classification Cross-Modal Retrieval
— Unverified 0Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering Aug 31, 2020 Knowledge Graphs Question Answering
— Unverified 0Cross-Modal Generative Augmentation for Visual Question Answering May 11, 2021 Data Augmentation Question Answering
— Unverified 0American == White in Multimodal Language-and-Image AI Jul 1, 2022 Image Captioning Question Answering
— Unverified 0A Dataset for Multimodal Question Answering in the Cultural Heritage Domain Dec 1, 2016 Question Answering Speech Recognition
— Unverified 0Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining Aug 1, 2018 Question Answering Visual Grounding
— Unverified 0Interpretable Visual Question Answering by Reasoning on Dependency Trees Sep 6, 2018 Question Answering valid
— Unverified 0Interpretable Visual Question Answering via Reasoning Supervision Sep 7, 2023 Common Sense Reasoning Question Answering
— Unverified 0Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision Aug 1, 2020 Question Answering Visual Question Answering
— Unverified 0Crossformer: Transformer with Alternated Cross-Layer Guidance Sep 29, 2021 Inductive Bias Machine Translation
— Unverified 0Cross-Dataset Adaptation for Visual Question Answering Jun 10, 2018 Domain Adaptation Question Answering
— Unverified 0A Unified Framework for Multilingual and Code-Mixed Visual Question Answering Dec 1, 2020 Question Answering Visual Question Answering
— Unverified 0Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models Jan 20, 2020 Question Answering Visual Question Answering
— Unverified 0CQ-VQA: Visual Question Answering on Categorized Questions Feb 17, 2020 Question Answering Visual Question Answering
— Unverified 0Augmenting Image Question Answering Dataset by Exploiting Image Captions May 1, 2018 Data Augmentation Image Captioning
— Unverified 0CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment May 21, 2025 Language Modeling Language Modelling
— Unverified 0Co-VQA : Answering by Interactive Sub Question Sequence Apr 2, 2022 Question Answering Visual Question Answering
— Unverified 0``Look, some Green Circles!'': Learning to Quantify from Images Aug 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 0Interpretable Visual Question Answering Referring to Outside Knowledge Mar 8, 2023 Diversity Image Captioning
— Unverified 0Co-VQA : Answering by Interactive Sub Question Sequence Nov 16, 2021 Question Answering Visual Question Answering
— Unverified 0Audio-Visual Quality Assessment for User Generated Content: Database and Method Mar 4, 2023 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Accounting for Focus Ambiguity in Visual Questions Jan 4, 2025 Question Answering Visual Question Answering
— Unverified 0Counterfactual Vision and Language Learning Jun 1, 2020 counterfactual Question Answering
— Unverified 0All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 0Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models Jan 3, 2025 Binary Classification Face Anti-Spoofing
— Unverified 0Attentive Explanations: Justifying Decisions and Pointing to the Evidence (Extended Abstract) Nov 17, 2017 Question Answering Visual Question Answering (VQA)
— Unverified 0Cost Function Dependent Barren Plateaus in Shallow Parametrized Quantum Circuits Jan 2, 2020 Visual Question Answering (VQA)
— Unverified 0Attentive Explanations: Justifying Decisions and Pointing to the Evidence Dec 14, 2016 Decision Making Question Answering
— Unverified 0Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning Feb 19, 2023 Graph Learning Medical Visual Question Answering
— Unverified 0CoRe-MMRAG: Cross-Source Knowledge Reconciliation for Multimodal RAG Jun 3, 2025 Answer Generation RAG
— Unverified 0