An Empirical Comparison of Optimizers for Quantum Machine Learning with SPSA-based Gradients Apr 27, 2023 Quantum Machine Learning Visual Question Answering (VQA)
— Unverified 00 An Empirical Evaluation of Visual Question Answering for Novel Objects Apr 8, 2017 Question Answering Visual Question Answering
— Unverified 00 An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation Jul 31, 2019 Conditional Image Generation Few-Shot Learning
— Unverified 00 An Empirical Study on Leveraging Scene Graphs for Visual Question Answering Jul 28, 2019 Knowledge Graphs Question Answering
— Unverified 00 An Empirical Study on the Generalization Power of Neural Representations Learned via Visual Guessing Games Jan 31, 2021 Question Answering Visual Question Answering
— Unverified 00 An Empirical Study on the Language Modal in Visual Question Answering May 17, 2023 Question Answering Visual Question Answering
— Unverified 00 An Evaluation of GPT-4V and Gemini in Online VQA Dec 17, 2023 Question Answering Visual Question Answering
— Unverified 00 An Evaluation of Image-Based Verb Prediction Models against Human Eye-Tracking Data Jun 1, 2018 General Classification Question Answering
— Unverified 00 An experimental study of the vision-bottleneck in VQA Feb 14, 2022 Object Question Answering
— Unverified 00 Annotation Methodologies for Vision and Language Dataset Creation Jul 10, 2016 Action Recognition Image Description
— Unverified 00 A Novel Attention-based Aggregation Function to Combine Vision and Language Apr 27, 2020 General Classification Image Captioning
— Unverified 00 A Novel Framework for Robustness Analysis of Visual QA Models Nov 16, 2017 Question Answering Visual Question Answering
— Unverified 00 A Novel Stochastic LSTM Model Inspired by Quantum Machine Learning May 17, 2023 Quantum Machine Learning Visual Question Answering (VQA)
— Unverified 00 Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering Oct 17, 2020 Question Answering Visual Question Answering
— Unverified 00 Answer-Me: Multi-Task Open-Vocabulary Visual Question Answering May 2, 2022 Decoder Image Captioning
— Unverified 00 Answer-Type Prediction for Visual Question Answering Jun 1, 2016 Object Recognition Prediction
— Unverified 00 AOR: Anatomical Ontology-Guided Reasoning for Medical Large Multimodal Model in Chest X-Ray Interpretation May 5, 2025 Anatomy Diagnostic
— Unverified 00 A Picture May Be Worth a Hundred Words for Visual Question Answering Jun 25, 2021 Data Augmentation Descriptive
— Unverified 00 Application of Multimodal Large Language Models in Autonomous Driving Dec 21, 2024 Autonomous Driving Decision Making
— Unverified 00 ArcSin: Adaptive ranged cosine Similarity injected noise for Language-Driven Visual Tasks Feb 27, 2024 Domain Generalization Image Captioning
— Unverified 00 A reinforcement learning approach for VQA validation: an application to diabetic macular edema grading Jul 19, 2023 Medical Image Analysis Question Answering
— Unverified 00 A Reinforcement Learning Framework for Natural Question Generation using Bi-discriminators Aug 1, 2018 Attribute Natural Questions
— Unverified 00 A Restricted Visual Turing Test for Deep Scene and Event Understanding Dec 6, 2015 Question Answering Video Captioning
— Unverified 00 A review of Quantum Neural Networks: Methods, Models, Dilemma Sep 4, 2021 Computational Efficiency Visual Question Answering (VQA)
— Unverified 00 Are VQA Systems RAD? Measuring Robustness to Augmented Data with Focused Interventions Jun 8, 2021 Question Answering Visual Question Answering
— Unverified 00 Are we asking the right questions in MovieQA? Nov 8, 2019 Question Answering Visual Question Answering
— Unverified 00 Are we pretraining it right? Digging deeper into visio-linguistic pretraining Apr 19, 2020 Visual Question Answering (VQA)
— Unverified 00 Are You Smarter Than a Sixth Grader? Textbook Question Answering for Multimodal Machine Comprehension Jul 1, 2017 Question Answering Reading Comprehension
— Unverified 00 Are You Talking to a Machine? Dataset and Methods for Multilingual Image Question Dec 1, 2015 Question Answering Sentence
— Unverified 00 Are You Talking to Me? Reasoned Visual Dialog Generation through Adversarial Learning Nov 21, 2017 Question Answering Reinforcement Learning
— Unverified 00 ASCD: Attention-Steerable Contrastive Decoding for Reducing Hallucination in MLLM Jun 17, 2025 Hallucination Language Modeling
— Unverified 00 A Spectrum Evaluation Benchmark for Medical Multi-Modal Large Language Models Feb 17, 2024 Diagnostic Visual Question Answering (VQA)
— Unverified 00 A Shared Task on Multimodal Machine Translation and Crosslingual Image Description Aug 1, 2016 Image Description Image Retrieval
— Unverified 00 A Short Survey of Systematic Generalization Nov 22, 2022 Survey Systematic Generalization
— Unverified 00 Asking More Informative Questions for Grounded Retrieval Nov 14, 2023 Question Answering Question Selection
— Unverified 00 Asking questions on handwritten document collections Oct 2, 2021 Optical Character Recognition (OCR) Question Answering
— Unverified 00 Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources Nov 22, 2015 Form General Knowledge
— Unverified 00 Assessing Image Quality Issues for Real-World Problems Mar 27, 2020 Image Captioning Question Answering
— Unverified 00 Assessing the Robustness of Visual Question Answering Models Nov 30, 2019 Question Answering Visual Question Answering
— Unverified 00 Assessing Visual Quality of Omnidirectional Videos Jul 14, 2019 Visual Question Answering (VQA)
— Unverified 00 Assessment of Subjective and Objective Quality of Live Streaming Sports Videos Jun 15, 2021 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 Assisting Scene Graph Generation with Self-Supervision Aug 8, 2020 Graph Generation Image Captioning
— Unverified 00 Astrea: A MOE-based Visual Understanding Model with Progressive Alignment Mar 12, 2025 Contrastive Learning Cross-Modal Retrieval
— Unverified 00 A Study on Multimodal and Interactive Explanations for Visual Question Answering Mar 1, 2020 Explainable Artificial Intelligence (XAI) Prediction
— Unverified 00 A survey on knowledge-enhanced multimodal learning Nov 19, 2022 Conditional Image Generation Factual Visual Question Answering
— Unverified 00 A survey on VQA_Datasets and Approaches May 2, 2021 Question Answering Survey
— Unverified 00 A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering Jan 14, 2022 Generative Question Answering Image to text
— Unverified 00 A Token-level Text Image Foundation Model for Document Understanding Mar 4, 2025 document understanding Visual Question Answering (VQA)
— Unverified 00 A Transformer-based Cross-modal Fusion Model with Adversarial Training for VQA Challenge 2021 Jun 24, 2021 Visual Question Answering (VQA)
— Unverified 00 Attention Guided Semantic Relationship Parsing for Visual Question Answering Oct 5, 2020 Object Question Answering
— Unverified 00