Transfer Learning via Unsupervised Task Discovery for Visual Question Answering Oct 3, 2018 Question Answering Transfer Learning
Code Code Available 0What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility? Oct 26, 2022 Benchmarking Question Answering
Code Code Available 0Convincing Rationales for Visual Question Answering Reasoning Feb 6, 2024 Question Answering Visual Question Answering
Code Code Available 0Transformer Module Networks for Systematic Generalization in Visual Question Answering Jan 27, 2022 Question Answering Systematic Generalization
Code Code Available 0Robust Explanations for Visual Question Answering Jan 23, 2020 Question Answering Visual Question Answering
Code Code Available 0HAIBU-ReMUD: Reasoning Multimodal Ultrasound Dataset and Model Bridging to General Specific Domains Jun 9, 2025 Diagnostic Question Answering
Code Code Available 0Attribute Diversity Determines the Systematicity Gap in VQA Nov 15, 2023 Attribute Diagnostic
Code Code Available 0Visual Contexts Clarify Ambiguous Expressions: A Benchmark Dataset Nov 21, 2024 Question Answering Visual Grounding
Code Code Available 0Visual Coreference Resolution in Visual Dialog using Neural Module Networks Sep 6, 2018 Common Sense Reasoning coreference-resolution
Code Code Available 0Transparency by Design: Closing the Gap Between Performance and Interpretability in Visual Reasoning Mar 14, 2018 Question Answering Visual Question Answering
Code Code Available 0Hadamard Product for Low-rank Bilinear Pooling Oct 14, 2016 Visual Question Answering Visual Question Answering (VQA)
Code Code Available 0Routing Networks and the Challenges of Modular and Compositional Computation Apr 29, 2019 Language Modeling Language Modelling
Code Code Available 0RSAdapter: Adapting Multimodal Models for Remote Sensing Visual Question Answering Oct 19, 2023 Image Captioning Question Answering
Code Code Available 0Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types Sep 14, 2024 Language Modeling Language Modelling
Code Code Available 0Contrastive Visual-Linguistic Pretraining Jul 26, 2020 Contrastive Learning regression
Code Code Available 0Evaluating Point Cloud from Moving Camera Videos: A No-Reference Metric Aug 30, 2022 Image Quality Assessment Point Cloud Quality Assessment
Code Code Available 0Grounding Answers for Visual Questions Asked by Visually Impaired People Feb 4, 2022 Question Answering Visual Question Answering
Code Code Available 0RUBi: Reducing Unimodal Biases for Visual Question Answering Dec 1, 2019 Question Answering Visual Question Answering
Code Code Available 0RUBi: Reducing Unimodal Biases in Visual Question Answering Jun 24, 2019 Question Answering Visual Question Answering
Code Code Available 0Grad-CAM: Why did you say that? Nov 22, 2016 Image Captioning Visual Question Answering
Code Code Available 0GradBias: Unveiling Word Influence on Bias in Text-to-Image Generative Models Aug 29, 2024 Bias Detection Fairness
Code Code Available 0RVTBench: A Benchmark for Visual Reasoning Tasks May 17, 2025 Reasoning Segmentation Visual Question Answering (VQA)
Code Code Available 0Attention on Attention: Architectures for Visual Question Answering (VQA) Mar 21, 2018 GPU Question Answering
Code Code Available 0Ask Your Neurons: A Deep Learning Approach to Visual Question Answering May 9, 2016 Question Answering Visual Question Answering
Code Code Available 0Generalizing Visual Question Answering from Synthetic to Human-Written Questions via a Chain of QA with a Large Language Model Jan 12, 2024 Language Modeling Language Modelling
Code Code Available 0General Greedy De-bias Learning Dec 20, 2021 image-classification Image Classification
Code Code Available 0What's in a Question: Using Visual Questions as a Form of Supervision Apr 12, 2017 Data Augmentation Form
Code Code Available 0A Neuro-Symbolic ASP Pipeline for Visual Question Answering May 16, 2022 Question Answering Visual Question Answering
Code Code Available 0GAMIVAL: Video Quality Prediction on Mobile Cloud Gaming Content May 3, 2023 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0An Efficient Modern Baseline for FloodNet VQA May 30, 2022 Management Visual Question Answering (VQA)
Code Code Available 0Black-box Model Ensembling for Textual and Visual Question Answering via Information Fusion Jul 4, 2024 Question Answering Visual Question Answering
Code Code Available 0Game of Sketches: Deep Recurrent Models of Pictionary-style Word Guessing Jan 29, 2018 Question Answering Visual Question Answering
Code Code Available 0Ask, Attend and Answer: Exploring Question-Guided Spatial Attention for Visual Question Answering Nov 17, 2015 Image Captioning Question Answering
Code Code Available 0TUBench: Benchmarking Large Vision-Language Models on Trustworthiness with Unanswerable Questions Oct 5, 2024 Benchmarking Hallucination
Code Code Available 0Tutorial on Answering Questions about Images with Deep Learning Oct 4, 2016 Deep Learning Natural Language Understanding
Code Code Available 0Scene Graph Prediction with Limited Labels Apr 25, 2019 Knowledge Base Completion Prediction
Code Code Available 0Continual VQA for Disaster Response Systems Sep 21, 2022 Disaster Response Management
Code Code Available 0What value do explicit high level concepts have in vision to language problems? Jun 3, 2015 Image Captioning Question Answering
Code Code Available 0FVQ: A Large-Scale Dataset and A LMM-based Method for Face Video Quality Assessment Apr 12, 2025 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0Two-Level Approach for No-Reference Consumer Video Quality Assessment Jun 20, 2019 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0Zero-shot Visual Question Answering with Language Model Feedback May 26, 2023 Language Modeling Language Modelling
Code Code Available 0Fully Authentic Visual Question Answering Dataset from Online Communities Nov 27, 2023 Question Answering Visual Question Answering
Code Code Available 0Analyzing the Behavior of Visual Question Answering Models Jun 23, 2016 Question Answering Visual Question Answering
Code Code Available 0Adapting Visual Question Answering Models for Enhancing Multimodal Community Q&A Platforms Aug 29, 2018 Community Question Answering General Classification
Code Code Available 0From Images to Textual Prompts: Zero-shot VQA with Frozen Large Language Models Dec 21, 2022 Question Answering Visual Question Answering
Code Code Available 0A simple neural network module for relational reasoning Jun 5, 2017 Image Retrieval with Multi-Modal Query Question Answering
Code Code Available 0Visually Grounded VQA by Lattice-based Retrieval Nov 15, 2022 Information Retrieval Question Answering
Code Code Available 0VQA4CIR: Boosting Composed Image Retrieval with Visual Question Answering Dec 19, 2023 Image Retrieval Question Answering
Code Code Available 0UGC Quality Assessment: Exploring the Impact of Saliency in Deep Feature-Based Quality Assessment Aug 13, 2023 Video Quality Assessment Visual Question Answering (VQA)
Code Code Available 0FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question Answering May 27, 2025 Benchmarking Question Answering
Code Code Available 0