| Connective Cognition Network for Directional Visual Commonsense Reasoning | Dec 1, 2019 | SentenceVisual Commonsense Reasoning | CodeCode Available | 0 | 5 |
| Heterogeneous Graph Learning for Visual Commonsense Reasoning | Oct 25, 2019 | Graph LearningVisual Commonsense Reasoning | CodeCode Available | 0 | 5 |
| Compositional Image-Text Matching and Retrieval by Grounding Entities | May 4, 2025 | Image CaptioningImage-text matching | CodeCode Available | 0 | 5 |
| TAB-VCR: Tags and Attributes based Visual Commonsense Reasoning Baselines | Oct 31, 2019 | AttributeQuestion Answering | CodeCode Available | 0 | 5 |
| VASR: Visual Analogies of Situation Recognition | Dec 8, 2022 | Common Sense ReasoningTriplet | CodeCode Available | 0 | 5 |
| ILLUME: Rationalizing Vision-Language Models through Human Interactions | Aug 17, 2022 | Image CaptioningQuestion Answering | CodeCode Available | 0 | 5 |
| ViP-LLaVA: Making Large Multimodal Models Understand Arbitrary Visual Prompts | Dec 1, 2023 | Visual Commonsense ReasoningVisual Prompting | CodeCode Available | 0 | 5 |
| Fusion of Detected Objects in Text for Visual Question Answering | Aug 14, 2019 | Question AnsweringVisual Commonsense Reasoning | CodeCode Available | 0 | 5 |
| Cognitive Visual Commonsense Reasoning Using Dynamic Working Memory | Jul 4, 2021 | Question AnsweringScene Understanding | CodeCode Available | 0 | 5 |
| From Recognition to Cognition: Visual Commonsense Reasoning | Nov 27, 2018 | Multiple-choiceMultiple Choice Question Answering (MCQA) | CodeCode Available | 0 | 5 |