| PAR: Prompt-Aware Token Reduction Method for Efficient Large Multimodal Models | Oct 9, 2024 | Question AnsweringRetrieval | —Unverified | 0 | 0 |
| Retrieving Visual Facts For Few-Shot Visual Question Answering | Jan 16, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Reusable Slotwise Mechanisms | Feb 21, 2023 | Future predictionObject | —Unverified | 0 | 0 |
| Visual Question Answering in the Medical Domain | Sep 20, 2023 | Contrastive LearningMedical Visual Question Answering | —Unverified | 0 | 0 |
| Chop Chop BERT: Visual Question Answering by Chopping VisualBERT's Heads | Apr 30, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Visual Question Answering on 360° Images | Jan 10, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Revisiting Multi-Modal LLM Evaluation | Aug 9, 2024 | Chart UnderstandingOptical Character Recognition | —Unverified | 0 | 0 |
| Visual Question Answering on Image Sets | Aug 27, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| ChitroJera: A Regionally Relevant Visual Question Answering Dataset for Bangla | Oct 19, 2024 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| ReWind: Understanding Long Videos with Instructed Learnable Memory | Nov 23, 2024 | Large Language ModelQuestion Answering | —Unverified | 0 | 0 |
| Visual Question Answering on Multiple Remote Sensing Image Modalities | May 21, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| ReXVQA: A Large-scale Visual Question Answering Benchmark for Generalist Chest X-ray Understanding | Jun 4, 2025 | NegationNegation Detection | —Unverified | 0 | 0 |
| A Causal Approach to Mitigate Modality Preference Bias in Medical Visual Question Answering | May 22, 2025 | counterfactualMedical Visual Question Answering | —Unverified | 0 | 0 |
| CHIC: Corporate Document for Visual question Answering | May 1, 2023 | Information RetrievalQuestion Answering | —Unverified | 0 | 0 |
| RL-CSDia: Representation Learning of Computer Science Diagrams | Mar 10, 2021 | Question AnsweringRepresentation Learning | —Unverified | 0 | 0 |
| Charting the Future: Using Chart Question-Answering for Scalable Evaluation of LLM-Driven Data Visualizations | Sep 27, 2024 | Chart Question AnsweringQuestion Answering | —Unverified | 0 | 0 |
| R-LLaVA: Improving Med-VQA Understanding through Visual Region of Interest | Oct 27, 2024 | Medical Visual Question AnsweringMultiple-choice | —Unverified | 0 | 0 |
| RMLVQA: A Margin Loss Approach for Visual Question Answering With Language Biases | Jan 1, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Robo2VLM: Visual Question Answering from Large-Scale In-the-Wild Robot Manipulation Datasets | May 21, 2025 | Dataset GenerationDescriptive | —Unverified | 0 | 0 |
| RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis | Feb 25, 2024 | Code GenerationMultimodal Reasoning | —Unverified | 0 | 0 |
| RoboMamba: Efficient Vision-Language-Action Model for Robotic Reasoning and Manipulation | Jun 6, 2024 | Common Sense ReasoningMamba | —Unverified | 0 | 0 |
| Robotic Environmental State Recognition with Pre-Trained Vision-Language Models and Black-Box Optimization | Sep 26, 2024 | Image to textImage-to-Text Retrieval | —Unverified | 0 | 0 |
| Visual Question Answering Using Semantic Information from Image Descriptions | Apr 23, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Characterizing Misclassifications of Deep NLP Models | Mar 12, 2021 | named-entity-recognitionNamed Entity Recognition | —Unverified | 0 | 0 |
| Robustness Analysis of Visual QA Models by Basic Questions | Sep 14, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |