| Visual Question Answering in the Medical Domain | Sep 20, 2023 | Contrastive LearningMedical Visual Question Answering | —Unverified | 0 |
| Visual Question Answering on 360° Images | Jan 10, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering on Image Sets | Aug 27, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering on Multiple Remote Sensing Image Modalities | May 21, 2025 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering Using Semantic Information from Image Descriptions | Apr 23, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering (VQA) on Images with Superimposed Text | Jun 13, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering with Memory-Augmented Networks | Jul 17, 2017 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering with Prior Class Semantics | May 4, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Answering with Question Representation Update (QRU) | Dec 1, 2016 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Question Generation as Dual Task of Visual Question Answering | Sep 21, 2017 | Question AnsweringQuestion Generation | —Unverified | 0 |
| Visual Question: Predicting If a Crowd Will Agree on the Answer | Aug 29, 2016 | Question Answeringvalid | —Unverified | 0 |
| Visual Question Reasoning on General Dependency Tree | Mar 31, 2018 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| Visual Reference Resolution using Attention Memory for Visual Dialog | Sep 23, 2017 | Parameter PredictionQuestion Answering | —Unverified | 0 |
| Visual Relationship Detection using Scene Graphs: A Survey | May 16, 2020 | Graph GenerationImage Generation | —Unverified | 0 |
| Visual Superordinate Abstraction for Robust Concept Learning | May 28, 2022 | AttributeQuestion Answering | —Unverified | 0 |
| Visual TTR - Modelling Visual Question Answering in Type Theory with Records | May 1, 2019 | Question AnsweringVisual Question Answering | —Unverified | 0 |
| ViT3D Alignment of LLaMA3: 3D Medical Image Report Generation | Oct 11, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 |
| ViUniT: Visual Unit Tests for More Robust Visual Programming | Dec 12, 2024 | Image GenerationImage-text matching | —Unverified | 0 |
| VL-BEiT: Generative Vision-Language Pretraining | Jun 2, 2022 | image-classificationImage Classification | —Unverified | 0 |
| VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment | Oct 12, 2024 | DiversityHallucination | —Unverified | 0 |
| VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks | Oct 7, 2024 | Information RetrievalLanguage Modeling | —Unverified | 0 |
| VLMAE: Vision-Language Masked Autoencoder | Aug 19, 2022 | Image-text RetrievalLanguage Modeling | —Unverified | 0 |
| VL-Mamba: Exploring State Space Models for Multimodal Learning | Mar 20, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VLM-Assisted Continual learning for Visual Question Answering in Self-Driving | Feb 2, 2025 | Autonomous DrivingContinual Learning | —Unverified | 0 |
| VLR-Bench: Multilingual Benchmark Dataset for Vision-Language Retrieval Augmented Generation | Dec 13, 2024 | Instruction FollowingQuestion Answering | —Unverified | 0 |