| The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation | Nov 28, 2023 | DiversityQuestion Answering | —Unverified | 0 | 0 |
| The Forgettable-Watcher Model for Video Question Answering | May 3, 2017 | modelQuestion Answering | —Unverified | 0 | 0 |
| AdaCoder: Adaptive Prompt Compression for Programmatic Visual Question Answering | Jul 28, 2024 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| The Impact of Explanations on AI Competency Prediction in VQA | Jul 2, 2020 | AI AgentLanguage Modeling | —Unverified | 0 | 0 |
| The meaning of "most" for visual question answering models | Dec 31, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| The Meaning of ``Most'' for Visual Question Answering Models | Aug 1, 2019 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| VQA-Diff: Exploiting VQA and Diffusion for Zero-Shot Image-to-3D Vehicle Asset Generation in Autonomous Driving | Jul 9, 2024 | Autonomous DrivingImage to 3D | —Unverified | 0 | 0 |
| VQA-E: Explaining, Elaborating, and Enhancing Your Answers for Visual Questions | Mar 20, 2018 | Explanatory Visual Question AnsweringMulti-Task Learning | —Unverified | 0 | 0 |
| The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering | Jan 13, 2025 | Common Sense ReasoningQuestion Answering | —Unverified | 0 | 0 |
| A Vision Centric Remote Sensing Benchmark | Mar 20, 2025 | Question AnsweringRepresentation Learning | —Unverified | 0 | 0 |
| The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions | Dec 16, 2016 | BIG-bench Machine LearningQuestion Answering | —Unverified | 0 | 0 |
| The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA | Sep 12, 2018 | Question AnsweringSemantic Similarity | —Unverified | 0 | 0 |
| AVIS: Autonomous Visual Information Seeking with Large Language Model Agent | Jun 13, 2023 | Decision MakingLanguage Modeling | —Unverified | 0 | 0 |
| TI-JEPA: An Innovative Energy-based Joint Embedding Strategy for Text-Image Multimodal Systems | Mar 9, 2025 | Multimodal Sentiment AnalysisQuestion Answering | —Unverified | 0 | 0 |
| VQA-GEN: A Visual Question Answering Benchmark for Domain Generalization | Nov 1, 2023 | Domain GeneralizationQuestion Answering | —Unverified | 0 | 0 |
| VQA-GNN: Reasoning with Multimodal Knowledge via Graph Neural Networks for Visual Question Answering | May 23, 2022 | Knowledge GraphsQuestion Answering | —Unverified | 0 | 0 |
| TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving | May 21, 2025 | Autonomous DrivingQuestion Answering | —Unverified | 0 | 0 |
| Auto-Parsing Network for Image Captioning and Visual Question Answering | Aug 24, 2021 | Image CaptioningQuestion Answering | —Unverified | 0 | 0 |
| A Unified Framework for Multilingual and Code-Mixed Visual Question Answering | Dec 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices | Apr 4, 2024 | QuantizationQuestion Answering | —Unverified | 0 | 0 |
| VQA-LOL: Visual Question Answering under the Lens of Logic | Feb 19, 2020 | NegationQuestion Answering | —Unverified | 0 | 0 |
| TM-PATHVQA:90000+ Textless Multilingual Questions for Medical Visual Question Answering | Jul 16, 2024 | Medical Visual Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs | Apr 10, 2025 | Ensemble LearningPosition | —Unverified | 0 | 0 |
| Attentive Explanations: Justifying Decisions and Pointing to the Evidence | Dec 14, 2016 | Decision MakingQuestion Answering | —Unverified | 0 | 0 |
| Attention Overlap Is Responsible for The Entity Missing Problem in Text-to-image Diffusion Models! | Oct 28, 2024 | DenoisingQuestion Answering | —Unverified | 0 | 0 |