| Ontology-based knowledge representation for bone disease diagnosis: a foundation for safe and sustainable medical artificial intelligence systems | Jun 5, 2025 | DiagnosticMultimodal Deep Learning | —Unverified | 0 | 0 |
| Assistive Image Annotation Systems with Deep Learning and Natural Language Capabilities: A Review | Jun 28, 2024 | Active LearningImage Captioning | —Unverified | 0 | 0 |
| Towards Models that Can See and Read | Jan 18, 2023 | DecoderImage Captioning | —Unverified | 0 | 0 |
| Active Data Curation Effectively Distills Large-Scale Multimodal Models | Nov 27, 2024 | DecoderImage Captioning | —Unverified | 0 | 0 |
| Assisting Scene Graph Generation with Self-Supervision | Aug 8, 2020 | Graph GenerationImage Captioning | —Unverified | 0 | 0 |
| Towards Omnidirectional Reasoning with 360-R1: A Dataset, Benchmark, and GRPO-based Method | May 20, 2025 | HallucinationObject Localization | —Unverified | 0 | 0 |
| Towards Reasoning-Aware Explainable VQA | Nov 9, 2022 | DecoderExplanation Generation | —Unverified | 0 | 0 |
| Assessing the Robustness of Visual Question Answering Models | Nov 30, 2019 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Towards Semantic Equivalence of Tokenization in Multimodal LLM | Jun 7, 2024 | Visual Question Answering | —Unverified | 0 | 0 |
| Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering | Nov 29, 2023 | Common Sense ReasoningQuestion Answering | —Unverified | 0 | 0 |
| Towards Transparent AI Systems: Interpreting Visual Question Answering Models | Aug 31, 2016 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers | Jan 3, 2024 | Question AnsweringVisual Grounding | —Unverified | 0 | 0 |
| Assessing Image Quality Issues for Real-World Problems | Mar 27, 2020 | Image CaptioningQuestion Answering | —Unverified | 0 | 0 |
| Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason? | Dec 20, 2022 | Question AnsweringRepresentation Learning | —Unverified | 0 | 0 |
| Ask Me Anything: Free-form Visual Question Answering Based on Knowledge from External Sources | Nov 22, 2015 | FormGeneral Knowledge | —Unverified | 0 | 0 |
| Asking questions on handwritten document collections | Oct 2, 2021 | Optical Character Recognition (OCR)Question Answering | —Unverified | 0 | 0 |
| Towards Visual Dialog for Radiology | Jul 1, 2020 | Question AnsweringVisual Dialog | —Unverified | 0 | 0 |
| A Corpus for Visual Question Answering Annotated with Frame Semantic Information | May 1, 2020 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Toward Unsupervised Realistic Visual Question Answering | Mar 9, 2023 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Tracking the Copyright of Large Vision-Language Models through Parameter Learning Adversarial Images | Feb 23, 2025 | Adversarial AttackQuestion Answering | —Unverified | 0 | 0 |
| VSA4VQA: Scaling a Vector Symbolic Architecture to Visual Question Answering on Natural Images | May 6, 2024 | AttributeLanguage Modeling | —Unverified | 0 | 0 |
| Training Recurrent Answering Units with Joint Loss Minimization for VQA | Jun 12, 2016 | Question AnsweringVisual Question Answering | —Unverified | 0 | 0 |
| Transferable Adversarial Attacks on Black-Box Vision-Language Models | May 2, 2025 | Image CaptioningObject Recognition | —Unverified | 0 | 0 |
| A Confidence-Based Interface for Neuro-Symbolic Visual Question Answering | Nov 21, 2021 | Question AnsweringTranslation | —Unverified | 0 | 0 |
| MMPKUBase: A Comprehensive and High-quality Chinese Multi-modal Knowledge Graph | Aug 3, 2024 | AttributeContrastive Learning | —Unverified | 0 | 0 |