| Bayesian Attention Belief Networks | Jun 9, 2021 | DecoderMachine Translation | —Unverified | 0 |
| CVQA: Culturally-diverse Multilingual Visual Question Answering Benchmark | Jun 10, 2024 | DiversityQuestion Answering | —Unverified | 0 |
| BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering | Jul 28, 2023 | Question AnsweringVietnamese Visual Question Answering | —Unverified | 0 |
| C-VQA: A Compositional Split of the Visual Question Answering (VQA) v1.0 Dataset | Apr 26, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Interpretable Bilingual Multimodal Large Language Model for Diverse Biomedical Tasks | Oct 24, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Barriers in Integrating Medical Visual Question Answering into Radiology Workflows: A Scoping Review and Clinicians' Insights | Jul 9, 2025 | DiagnosticMedical Visual Question Answering | —Unverified | 0 |
| Curriculum Script Distillation for Multilingual Visual Question Answering | Jan 17, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| A Causal Approach to Mitigate Modality Preference Bias in Medical Visual Question Answering | May 22, 2025 | counterfactualMedical Visual Question Answering | —Unverified | 0 |
| Curriculum Learning for Compositional Visual Reasoning | Mar 27, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Curriculum Learning Effectively Improves Low Data VQA | Dec 1, 2021 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| An Empirical Study of Batch Normalization and Group Normalization in Conditional Computation | Jul 31, 2019 | Conditional Image GenerationFew-Shot Learning | —Unverified | 0 |
| Prompting Medical Large Vision-Language Models to Diagnose Pathologies by Visual Question Answering | Jul 31, 2024 | DiagnosticHallucination | —Unverified | 0 |
| Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering | May 24, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| CTRL-O: Language-Controllable Object-Centric Visual Representation Learning | Mar 27, 2025 | Image GenerationObject | —Unverified | 0 |
| Barking Up The Syntactic Tree: Enhancing VLM Training with Syntactic Losses | Dec 11, 2024 | Image-text RetrievalQuestion Answering | —Unverified | 0 |
| CT-Agent: A Multimodal-LLM Agent for 3D CT Radiology Question Answering | May 22, 2025 | Computed Tomography (CT)Question Answering | —Unverified | 0 |
| CS-VQA: Visual Question Answering with Compressively Sensed Images | Jun 8, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Balancing Performance and Efficiency in Zero-shot Robotic Navigation | Jun 5, 2024 | Computational EfficiencyQuestion Answering | —Unverified | 0 |
| CrossVQA: Scalably Generating Benchmarks for Systematically Testing VQA Generalization | Nov 1, 2021 | Answer GenerationQuestion-Answer-Generation | —Unverified | 0 |
| Cross-Modal Safety Mechanism Transfer in Large Vision-Language Models | Oct 16, 2024 | Visual Question Answering | —Unverified | 0 |
| Cross-Modal Retrieval Augmentation for Multi-Modal Classification | Apr 16, 2021 | ClassificationCross-Modal Retrieval | —Unverified | 0 |
| BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs | Jul 3, 2024 | Image CaptioningImage Generation | —Unverified | 0 |
| An Empirical Evaluation of Visual Question Answering for Novel Objects | Apr 8, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Interpretable Counting for Visual Question Answering | Dec 23, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Cross-modal Knowledge Reasoning for Knowledge-based Visual Question Answering | Aug 31, 2020 | Knowledge GraphsQuestion Answering | —Unverified | 0 |