| TAB-VCR: Tags and Attributes based VCR Baselines | Dec 1, 2019 | AttributeQuestion Answering | CodeCode Available | 0 |
| Think Visually: Question Answering through Virtual Imagery | May 25, 2018 | Question AnsweringVisual Commonsense Reasoning | CodeCode Available | 0 |
| Learning to Correction: Explainable Feedback Generation for Visual Commonsense Reasoning Distractor | Dec 8, 2024 | MisconceptionsMultiple-choice | CodeCode Available | 0 |
| ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts | Dec 1, 2023 | Visual Commonsense ReasoningVisual Prompting | CodeCode Available | 0 |
| Joint Answering and Explanation for Visual Commonsense Reasoning | Feb 25, 2022 | Knowledge DistillationQuestion Answering | CodeCode Available | 0 |
| Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory | Jul 4, 2021 | Question AnsweringScene Understanding | CodeCode Available | 0 |
| Connective Cognition Network for Directional Visual Commonsense Reasoning | Dec 1, 2019 | SentenceVisual Commonsense Reasoning | CodeCode Available | 0 |
| Interpretable Visual Understanding with Cognitive Attention Network | Aug 6, 2021 | Scene UnderstandingVisual Commonsense Reasoning | CodeCode Available | 0 |
| Heterogeneous Graph Learning for Visual Commonsense Reasoning | Oct 25, 2019 | Graph LearningVisual Commonsense Reasoning | CodeCode Available | 0 |
| ILLUME: Rationalizing Vision-Language Models through Human Interactions | Aug 17, 2022 | Image CaptioningQuestion Answering | CodeCode Available | 0 |