| BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | Jan 30, 2023 | Generative Visual Question AnsweringImage Captioning | CodeCode Available | 4 |
| UnICLAM:Contrastive Representation Learning with Adversarial Masking for Unified and Interpretable Medical Vision Question Answering | Dec 21, 2022 | Data AugmentationDecision Making | —Unverified | 0 |
| Self-supervised vision-language pretraining for Medical visual question answering | Nov 24, 2022 | Contrastive LearningImage-text matching | CodeCode Available | 1 |
| MF2-MVQA: A Multi-stage Feature Fusion method for Medical Visual Question Answering | Nov 11, 2022 | Medical Visual Question AnsweringQuestion Answering | —Unverified | 0 |
| A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering | Oct 1, 2022 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 0 |
| RepsNet: Combining Vision with Language for Automated Medical Reports | Sep 27, 2022 | Contrastive LearningDecoder | —Unverified | 0 |
| OVQA: A Clinically Generated Visual Question Answering Dataset | Jul 7, 2022 | BenchmarkingMedical Visual Question Answering | —Unverified | 0 |
| ViLMedic: a framework for research at the intersection of vision and language in medical AI | May 1, 2022 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 0 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 |
| Does CLIP Benefit Visual Question Answering in the Medical Domain as Much as it Does in the General Domain? | Dec 27, 2021 | ArticlesMedical Visual Question Answering | —Unverified | 0 |
| Medical Visual Question Answering: A Survey | Nov 19, 2021 | Medical Visual Question AnsweringQuestion Answering | —Unverified | 0 |
| V-Learning -- A Simple, Efficient, Decentralized Algorithm for Multiagent RL | Oct 27, 2021 | Medical Visual Question AnsweringQ-Learning | —Unverified | 0 |
| MuVAM: A Multi-View Attention-based Model for Medical Visual Question Answering | Jul 7, 2021 | Medical Visual Question AnsweringMissing Labels | —Unverified | 0 |
| Multi-modal Understanding and Generation for Medical Images and Text via Vision-Language Pre-Training | May 24, 2021 | Image CaptioningMedical Visual Question Answering | CodeCode Available | 1 |
| Multiple Meta-model Quantifying for Medical Visual Question Answering | May 19, 2021 | Medical Visual Question AnsweringMeta-Learning | CodeCode Available | 1 |
| SLAKE: A Semantically-Labeled Knowledge-Enhanced Dataset for Medical Visual Question Answering | Feb 18, 2021 | Medical Visual Question AnsweringQuestion Answering | CodeCode Available | 1 |
| Hierarchical Deep Multi-modal Network for Medical Visual Question Answering | Sep 27, 2020 | DescriptiveMedical Visual Question Answering | CodeCode Available | 0 |
| A Comparison of Pre-trained Vision-and-Language Models for Multimodal Representation Learning across Medical Images and Reports | Sep 3, 2020 | Image-text RetrievalMedical Visual Question Answering | CodeCode Available | 1 |
| PathVQA: 30000+ Questions for Medical Visual Question Answering | Mar 7, 2020 | AI AgentMedical Visual Question Answering | CodeCode Available | 1 |
| Overcoming Data Limitation in Medical Visual Question Answering | Sep 26, 2019 | DenoisingMedical Visual Question Answering | CodeCode Available | 1 |
| Leveraging Medical Visual Question Answering with Supporting Facts | May 28, 2019 | DiversityMedical Visual Question Answering | —Unverified | 0 |
| A dataset of clinically generated visual questions and answers about radiology images | Nov 20, 2018 | Decision MakingMedical Visual Question Answering | —Unverified | 0 |