| Adapting Lightweight Vision Language Models for Radiological Visual Question Answering | Jun 17, 2025 | DiagnosticQuestion Answering | CodeCode Available | 0 |
| Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering | Apr 22, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Bilaterally Slimmable Transformer for Elastic and Efficient Visual Question Answering | Mar 24, 2022 | GPUQuestion Answering | CodeCode Available | 0 |
| BLOCK: Bilinear Superdiagonal Fusion for Visual Question Answering and Visual Relationship Detection | Jan 31, 2019 | Question AnsweringRelationship Detection | CodeCode Available | 0 |
| Don't Just Assume; Look and Answer: Overcoming Priors for Visual Question Answering | Dec 1, 2017 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Does Chain-of-Thought Reasoning Help Mobile GUI Agent? An Empirical Study | Mar 21, 2025 | AttributeMathematical Problem-Solving | CodeCode Available | 0 |
| Resource-efficient Inference with Foundation Model Programs | Apr 9, 2025 | modelQuestion Answering | CodeCode Available | 0 |
| Is Multimodal Vision Supervision Beneficial to Language? | Feb 10, 2023 | Image RetrievalNatural Language Understanding | CodeCode Available | 0 |
| DocMIA: Document-Level Membership Inference Attacks against DocVQA Models | Feb 6, 2025 | document understandingInference Attack | CodeCode Available | 0 |
| DLaVA: Document Language and Vision Assistant for Answer Localization with Enhanced Interpretability and Trustworthiness | Nov 29, 2024 | Optical Character Recognition (OCR)Question Answering | CodeCode Available | 0 |
| Zero-shot Visual Question Answering with Language Model Feedback | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering | Nov 17, 2015 | Image CaptioningQuestion Answering | CodeCode Available | 0 |
| IQ-VQA: Intelligent Visual Question Answering | Jul 8, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |
| Discrete Subgraph Sampling for Interpretable Graph based Visual Question Answering | Dec 11, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 0 |
| A simple neural network module for relational reasoning | Jun 5, 2017 | Image Retrieval with Multi-Modal QueryQuestion Answering | CodeCode Available | 0 |
| Towards Knowledge-Augmented Visual Question Answering | Dec 1, 2020 | General KnowledgeGraph Attention | CodeCode Available | 0 |
| Towards Language-guided Visual Recognition via Dynamic Convolutions | Oct 17, 2021 | Question AnsweringReferring Expression | CodeCode Available | 0 |
| REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory | Dec 10, 2022 | Image CaptioningLanguage Modeling | CodeCode Available | 0 |
| IQA: Visual Question Answering in Interactive Environments | Dec 9, 2017 | NavigateReinforcement Learning | CodeCode Available | 0 |
| Revisiting CroPA: A Reproducibility Study and Enhancements for Cross-Prompt Adversarial Transferability in Vision-Language Models | Jun 28, 2025 | image-classificationImage Classification | CodeCode Available | 0 |
| Discovering the Unknown Knowns: Turning Implicit Knowledge in the Dataset into Explicit Training Examples for Visual Question Answering | Sep 13, 2021 | Data AugmentationQuestion Answering | CodeCode Available | 0 |
| Revisiting Visual Question Answering Baselines | Jun 27, 2016 | Binary ClassificationMultiple-choice | CodeCode Available | 0 |
| iParaphrasing: Extracting Visually Grounded Paraphrases via an Image | Jun 12, 2018 | Image CaptioningQuestion Answering | CodeCode Available | 0 |
| BioD2C: A Dual-level Semantic Consistency Constraint Framework for Biomedical VQA | Mar 4, 2025 | Medical DiagnosisQuestion Answering | CodeCode Available | 0 |
| REXUP: I REason, I EXtract, I UPdate with Structured Compositional Reasoning for Visual Question Answering | Jul 27, 2020 | Question AnsweringVisual Question Answering | CodeCode Available | 0 |