What BERT Sees: Cross-Modal Transfer for Visual Question Generation Feb 25, 2020 Question Generation Question-Generation
— Unverified 0RankDVQA: Deep VQA based on Ranking-inspired Hybrid Training Feb 17, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0An Empirical Evaluation of Visual Question Answering for Novel Objects Apr 8, 2017 Question Answering Visual Question Answering
— Unverified 0Generating Question Relevant Captions to Aid Visual Question Answering Jun 3, 2019 General Knowledge Image Captioning
— Unverified 0Deep Video Quality Assessor: From Spatio-temporal Visual Sensitivity to A Convolutional Neural Aggregation Network Sep 1, 2018 Sensitivity Video Quality Assessment
— Unverified 0Deep Quality Assessment of Compressed Videos: A Subjective and Objective Study May 7, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Benchmarking Multimodal Models for Ukrainian Language Understanding Across Academic and Cultural Domains Nov 22, 2024 Benchmarking Caption Generation
— Unverified 0An Empirical Comparison of Optimizers for Quantum Machine Learning with SPSA-based Gradients Apr 27, 2023 Quantum Machine Learning Visual Question Answering (VQA)
— Unverified 0Improving Vision-and-Language Reasoning via Spatial Relations Modeling Nov 9, 2023 Position regression Relation
— Unverified 0Deep learning evaluation using deep linguistic processing Jun 5, 2017 Deep Learning Multimodal Deep Learning
— Unverified 0Deep Exemplar Networks for VQA and VQG Dec 19, 2019 Decoder Question Answering
— Unverified 0Benchmarking Large Multimodal Models for Ophthalmic Visual Question Answering with OphthalWeChat May 26, 2025 Benchmarking Question Answering
— Unverified 0Deep Equilibrium Multimodal Fusion Jun 29, 2023 Visual Question Answering (VQA)
— Unverified 0Deep Bayesian Active Learning for Multiple Correct Outputs Dec 2, 2019 Active Learning Answer Generation
— Unverified 0Being Negative but Constructively: Lessons Learnt from Creating Better Visual Question Answering Datasets Apr 24, 2017 Multiple-choice Question Answering
— Unverified 0Deep Attention Neural Tensor Network for Visual Question Answering Sep 1, 2018 Deep Attention Question Answering
— Unverified 0Decoupled Box Proposal and Featurization with Ultrafine-Grained Semantic Labels Improve Image Captioning and Visual Question Answering Sep 4, 2019 Image Captioning Object
— Unverified 0Decouple Before Interact: Multi-Modal Prompt Learning for Continual Visual Question Answering Jan 1, 2023 Continual Learning Language Modelling
— Unverified 0@Bench: Benchmarking Vision-Language Models for Human-centered Assistive Technology Sep 21, 2024 Benchmarking Depth Estimation
— Unverified 0``A Distorted Skull Lies in the Bottom Center...'' Identifying Paintings from Text Descriptions Jun 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 0Improving Visual Question Answering by Referring to Generated Paragraph Captions Jun 14, 2019 Decoder Image Captioning
— Unverified 0Integrating Frequency-Domain Representations with Low-Rank Adaptation in Vision-Language Models Mar 8, 2025 Caption Generation Question Answering
— Unverified 0Interpretable Visual Question Answering via Reasoning Supervision Sep 7, 2023 Common Sense Reasoning Question Answering
— Unverified 0Improving Cross-Modal Understanding in Visual Dialog via Contrastive Learning Apr 15, 2022 Contrastive Learning Question Answering
— Unverified 0Improving Data Augmentation for Robust Visual Question Answering with Effective Curriculum Learning Jan 28, 2024 Data Augmentation Question Answering
— Unverified 0Improving Generalization in Visual Reasoning via Self-Ensemble Oct 28, 2024 Visual Question Answering (VQA) Visual Reasoning
— Unverified 0Debating for Better Reasoning: An Unsupervised Multimodal Approach May 20, 2025 Question Answering Visual Question Answering
— Unverified 0Bayesian Attention Belief Networks Jun 9, 2021 Decoder Machine Translation
— Unverified 0Dealing with Missing Modalities in the Visual Question Answer-Difference Prediction Task through Knowledge Distillation Apr 13, 2021 Knowledge Distillation Triplet
— Unverified 0DDRprog: A CLEVR Differentiable Dynamic Reasoning Programmer Mar 30, 2018 Question Answering Visual Question Answering
— Unverified 0BARTPhoBEiT: Pre-trained Sequence-to-Sequence and Image Transformers Models for Vietnamese Visual Question Answering Jul 28, 2023 Question Answering Vietnamese Visual Question Answering
— Unverified 0An Analysis of Visual Question Answering Algorithms Mar 28, 2017 Question Answering Visual Question Answering
— Unverified 0Improving Medical Reasoning with Curriculum-Aware Reinforcement Learning May 25, 2025 Out-of-Distribution Generalization reinforcement-learning
— Unverified 0DCVQE: A Hierarchical Transformer for Video Quality Assessment Oct 10, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Davidsonian Scene Graph: Improving Reliability in Fine-grained Evaluation for Text-to-Image Generation Oct 27, 2023 Image Generation Question Answering
— Unverified 0Improved Few-Shot Image Classification Through Multiple-Choice Questions Jul 23, 2024 Articles Few-Shot Image Classification
— Unverified 0PlotQA: Reasoning over Scientific Plots Sep 3, 2019 Chart Question Answering Question Answering
— Unverified 0Improved Bilinear Pooling with CNNs Jul 21, 2017 GPU Question Answering
— Unverified 0Improving and Diagnosing Knowledge-Based Visual Question Answering via Entity Enhanced Knowledge Injection Dec 13, 2021 Common Sense Reasoning Knowledge Graph Embeddings
— Unverified 0Data-Driven Calibration of Prediction Sets in Large Vision-Language Models Based on Inductive Conformal Prediction Apr 24, 2025 Conformal Prediction Hallucination
— Unverified 0BACON: Improving Clarity of Image Captions via Bag-of-Concept Graphs Jul 3, 2024 Image Captioning Image Generation
— Unverified 0Data Augmentation for Visual Question Answering Sep 1, 2017 Data Augmentation General Classification
— Unverified 0DARE: Diverse Visual Question Answering with Robustness Evaluation Sep 26, 2024 image-classification Image Classification
— Unverified 0Backdooring Vision-Language Models with Out-Of-Distribution Data Oct 2, 2024 Image Captioning Image to text
— Unverified 0A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment Feb 25, 2020 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Improving Automatic VQA Evaluation Using Large Language Models Oct 4, 2023 In-Context Learning Question Answering
— Unverified 0Improving mitosis detection on histopathology images using large vision-language models Oct 11, 2023 Domain Generalization Image Captioning
— Unverified 0Achieving Human Parity on Visual Question Answering Nov 17, 2021 Question Answering Visual Question Answering
— Unverified 0Analysis on Image Set Visual Question Answering Mar 31, 2021 Question Answering Visual Question Answering
— Unverified 0Image Semantic Relation Generation Oct 19, 2022 Image Retrieval Image Segmentation
— Unverified 0