Knowledge Acquisition for Visual Question Answering via Iterative Querying Jul 1, 2017 Question Answering Visual Question Answering
— Unverified 0Knowledge-Based Counterfactual Queries for Visual Question Answering Mar 5, 2023 counterfactual Decision Making
— Unverified 0Knowledge-Based Visual Question Answering in Videos Apr 17, 2020 Question Answering Video Question Answering
— Unverified 0Knowledge Condensation and Reasoning for Knowledge-based VQA Mar 15, 2024 Question Answering Reading Comprehension
— Unverified 0Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering Jun 8, 2023 Question Answering Retrieval
— Unverified 0KNVQA: A Benchmark for evaluation knowledge-based VQA Nov 21, 2023 Hallucination Object Hallucination
— Unverified 0KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA Dec 20, 2020 Visual Question Answering (VQA)
— Unverified 0KVL-BERT: Knowledge Enhanced Visual-and-Linguistic BERT for Visual Commonsense Reasoning Dec 13, 2020 Sentence Visual Commonsense Reasoning
— Unverified 0KVQA: Knowledge-Aware Visual Question Answering Jul 17, 2019 Knowledge Graphs Question Answering
— Unverified 0Language bias in Visual Question Answering: A Survey and Taxonomy Nov 16, 2021 Question Answering Visual Question Answering
— Unverified 0Language Features Matter: Effective Language Representations for Vision-Language Tasks Aug 17, 2019 Image Captioning Language Modelling
— Unverified 0Language Models are General-Purpose Interfaces Jun 13, 2022 Causal Language Modeling Few-Shot Learning
— Unverified 0LAPDoc: Layout-Aware Prompting for Documents Feb 15, 2024 document understanding Key Information Extraction
— Unverified 0Large Scale Scene Text Verification with Guided Attention Apr 23, 2018 Question Answering Scene Text Detection
— Unverified 0Latent Image and Video Resolution Prediction using Convolutional Neural Networks Oct 17, 2024 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Latent Variable Models for Visual Question Answering Jan 16, 2021 Benchmarking Question Answering
— Unverified 0LaVida Drive: Vision-Text Interaction VLM for Autonomous Driving with Token Selection, Recovery and Enhancement Nov 20, 2024 Autonomous Driving Computational Efficiency
— Unverified 0LAVIS: A Library for Language-Vision Intelligence Sep 15, 2022 Benchmarking Image Captioning
— Unverified 0LCV2: An Efficient Pretraining-Free Framework for Grounded Visual Question Answering Jan 29, 2024 Language Modeling Language Modelling
— Unverified 0LEAF-QA: Locate, Encode & Attend for Figure Question Answering Jul 30, 2019 Chart Question Answering Question Answering
— Unverified 0Learning Answer Embeddings for Visual Question Answering Jun 10, 2018 Question Answering Transfer Learning
— Unverified 0Learning by Asking Questions Dec 4, 2017 Question Answering Visual Question Answering
— Unverified 0Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision Oct 24, 2022 cross-modal alignment Cross-Modal Retrieval
— Unverified 0Learning Compositional Representation for Few-shot Visual Question Answering Feb 21, 2021 Attribute Question Answering
— Unverified 0Learning Models for Actions and Person-Object Interactions with Transfer to Question Answering Apr 16, 2016 General Classification Human-Object Interaction Detection
— Unverified 0Learning Reasoning Paths over Semantic Graphs for Video-grounded Dialogues Mar 1, 2021 Question Answering Visual Question Answering
— Unverified 0Learning Rich Image Region Representation for Visual Question Answering Oct 29, 2019 Language Modeling Language Modelling
— Unverified 0Learning Sparse Mixture of Experts for Visual Question Answering Sep 19, 2019 Mixture-of-Experts Question Answering
— Unverified 0Learning to Answer Multilingual and Code-Mixed Questions Nov 14, 2022 AI Agent Question Answering
— Unverified 0Learning to Answer Questions From Image Using Convolutional Neural Network Jun 1, 2015 General Classification Question Answering
— Unverified 0Learning to Collocate Neural Modules for Image Captioning Apr 18, 2019 Decoder Image Captioning
— Unverified 0Learning to Compose Diversified Prompts for Image Emotion Classification Jan 26, 2022 Classification Emotion Classification
— Unverified 0Learning to Compress Contexts for Efficient Knowledge-based Visual Question Answering Sep 11, 2024 Question Answering Visual Question Answering
— Unverified 0Learning to Disambiguate by Asking Discriminative Questions Aug 9, 2017 Benchmarking Image Captioning
— Unverified 0Learning to Reason Iteratively and Parallelly for Complex Visual Reasoning Scenarios Nov 20, 2024 Question Answering Visual Question Answering (VQA)
— Unverified 0Neural Reasoning, Fast and Slow, for Video Question Answering Jul 10, 2019 Natural Questions Question Answering
— Unverified 0Learning to Recognize the Unseen Visual Predicates Sep 25, 2019 Question Answering Visual Question Answering
— Unverified 0Learning to Select Question-Relevant Relations for Visual Question Answering Jun 1, 2021 Graph Attention Question Answering
— Unverified 0Learning to Specialize with Knowledge Distillation for Visual Question Answering Dec 1, 2018 General Classification General Knowledge
— Unverified 0Learning Visual Knowledge Memory Networks for Visual Question Answering Jun 13, 2018 Question Answering Visual Question Answering
— Unverified 0Learning What Makes a Difference from Counterfactual Examples and Gradient Supervision Apr 20, 2020 counterfactual image-classification
— Unverified 0LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? Mar 25, 2025 Autonomous Navigation Question Answering
— Unverified 0Less Is More: Linear Layers on CLIP Features as Powerful VizWiz Model Jun 10, 2022 Question Answering Task 2
— Unverified 0Let's ViCE! Mimicking Human Cognitive Behavior in Image Generation Evaluation Jul 18, 2023 Image Generation Question Answering
— Unverified 0Leveraging Medical Visual Question Answering with Supporting Facts May 28, 2019 Diversity Medical Visual Question Answering
— Unverified 0Leveraging Video Descriptions to Learn Video Question Answering Nov 12, 2016 Question Answering Video Question Answering
— Unverified 0Leveraging Visual Question Answering for Image-Caption Ranking May 4, 2016 Image Retrieval Question Answering
— Unverified 0Leveraging Visual Question Answering to Improve Text-to-Image Synthesis Oct 28, 2020 Auxiliary Learning Image Generation
— Unverified 0Lingshu: A Generalist Foundation Model for Unified Multimodal Medical Understanding and Reasoning Jun 8, 2025 Medical Report Generation Question Answering
— Unverified 0LinguaMark: Do Multimodal Models Speak Fairly? A Benchmark-Based Evaluation Jul 9, 2025 Question Answering Visual Question Answering
— Unverified 0