| Understanding Information Storage and Transfer in Multi-modal Large Language Models | Jun 6, 2024 | Factual Visual Question AnsweringModel Editing | —Unverified | 0 |
| A Survey on Interpretable Cross-modal Reasoning | Sep 5, 2023 | Cross-Modal RetrievalDecision Making | CodeCode Available | 1 |
| A survey on knowledge-enhanced multimodal learning | Nov 19, 2022 | Conditional Image GenerationFactual Visual Question Answering | —Unverified | 0 |
| Out of the Box: Reasoning with Graph Convolution Nets for Factual Visual Question Answering | Nov 1, 2018 | Factual Visual Question AnsweringGeneral Knowledge | —Unverified | 0 |
| Straight to the Facts: Learning Knowledge Base Retrieval for Factual Visual Question Answering | Sep 4, 2018 | Factual Visual Question AnsweringGeneral Knowledge | —Unverified | 0 |