Accounting for Focus Ambiguity in Visual Questions Jan 4, 2025 Question Answering Visual Question Answering
— Unverified 00 Accuracy vs. Complexity: A Trade-off in Visual Question Answering Models Jan 20, 2020 Question Answering Visual Question Answering
— Unverified 00 Achieving Human Parity on Visual Question Answering Nov 17, 2021 Question Answering Visual Question Answering
— Unverified 00 A Comparative Evaluation of Temporal Pooling Methods for Blind Video Quality Assessment Feb 25, 2020 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 A Comprehensive Evaluation of Multi-Modal Large Language Models for Endoscopy Analysis May 29, 2025 Diagnostic Visual Prompting
— Unverified 00 A Systematic Evaluation of GPT-4V's Multimodal Capability for Medical Image Analysis Oct 31, 2023 Descriptive Medical Image Analysis
— Unverified 00 A Comprehensive Survey of Knowledge-Based Vision Question Answering Systems: The Lifecycle of Knowledge in Visual Reasoning Task Apr 24, 2025 Question Answering Retrieval
— Unverified 00 A Comprehensive Survey on Visual Question Answering Datasets and Algorithms Nov 17, 2024 Diagnostic Miscellaneous
— Unverified 00 A Confidence-Based Interface for Neuro-Symbolic Visual Question Answering Nov 21, 2021 Question Answering Translation
— Unverified 00 A Corpus for Visual Question Answering Annotated with Frame Semantic Information May 1, 2020 Question Answering Visual Question Answering
— Unverified 00 A Corpus of Natural Language for Visual Reasoning Jul 1, 2017 Question Answering Visual Question Answering (VQA)
— Unverified 00 Action Verb Corpus May 1, 2018 Action Classification Language Acquisition
— Unverified 00 Actively Seeking and Learning from Live Data Apr 5, 2019 Domain Adaptation Meta-Learning
— Unverified 00 Ada-DQA: Adaptive Diverse Quality-aware Feature Acquisition for Video Quality Assessment Aug 1, 2023 Diversity Knowledge Distillation
— Unverified 00 A Dataset for Multimodal Question Answering in the Cultural Heritage Domain Dec 1, 2016 Question Answering Speech Recognition
— Unverified 00 A dataset of clinically generated visual questions and answers about radiology images Nov 20, 2018 Decision Making Medical Visual Question Answering
— Unverified 00 ``A Distorted Skull Lies in the Bottom Center...'' Identifying Paintings from Text Descriptions Jun 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 00 Advancing Large Multi-modal Models with Explicit Chain-of-Reasoning and Visual Question Generation Jan 18, 2024 Caption Generation Language Modeling
— Unverified 00 Advancing Multimodal Medical Capabilities of Gemini May 6, 2024 Computed Tomography (CT) image-classification
— Unverified 00 Advancing Surgical VQA with Scene Graph Knowledge Dec 15, 2023 Question Answering Visual Question Answering
— Unverified 00 Advancing Video Quality Assessment for AIGC Sep 23, 2024 Image Generation Text Generation
— Unverified 00 AdvDreamer Unveils: Are Vision-Language Models Truly Ready for Real-World 3D Variations? Dec 4, 2024 Benchmarking Visual Question Answering (VQA)
— Unverified 00 Adventurer's Treasure Hunt: A Transparent System for Visually Grounded Compositional Visual Question Answering based on Scene Graphs Jun 28, 2021 Question Answering Task 2
— Unverified 00 Adversarial Attacks Beyond the Image Space Nov 20, 2017 Question Answering Visual Question Answering
— Unverified 00 Adversarial Multimodal Network for Movie Question Answering Jun 24, 2019 Question Answering Video Question Answering
— Unverified 00 Adversarial Regularization for Visual Question Answering: Strengths, Shortcomings, and Side Effects Jun 20, 2019 Question Answering Visual Question Answering
— Unverified 00 Adversarial Representation Learning for Text-to-Image Matching Aug 28, 2019 Image Captioning Language Modeling
— Unverified 00 Adversarial VQA: A New Benchmark for Evaluating the Robustness of VQA Models Jun 1, 2021 Data Augmentation Question Answering
— Unverified 00 Aesthetic Visual Question Answering of Photographs Aug 10, 2022 Question Answering Sentiment Analysis
— Unverified 00 A Focused Dynamic Attention Model for Visual Question Answering Apr 6, 2016 Question Answering Visual Question Answering
— Unverified 00 A Framework to Map VMAF with the Probability of Just Noticeable Difference between Video Encoding Recipes May 16, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 A Free Lunch in Generating Datasets: Building a VQG and VQA System with Attention and Humans in the Loop Nov 30, 2019 Question Answering Question Generation
— Unverified 00 A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions Mar 26, 2024 Gaze Target Estimation Question Answering
— Unverified 00 AGFSync: Leveraging AI-Generated Feedback for Preference Optimization in Text-to-Image Generation Mar 20, 2024 Image Generation Text to Image Generation
— Unverified 00 Aggregate-and-Adapt Natural Language Prompts for Downstream Generalization of CLIP Oct 31, 2024 Image Captioning Prompt Learning
— Unverified 00 A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language Models Nov 16, 2021 Language Modeling Language Modelling
— Unverified 00 AI2D-RST: A multimodal corpus of 1000 primary school science diagrams Dec 9, 2019 Question Answering Visual Question Answering
— Unverified 00 Aligned Dual Channel Graph Convolutional Network for Visual Question Answering Jul 1, 2020 Question Answering Visual Question Answering
— Unverified 00 Aligned Image-Word Representations Improve Inductive Transfer Across Vision-Language Tasks Apr 2, 2017 Multi-Task Learning Question Answering
— Unverified 00 Aligned Vector Quantization for Edge-Cloud Collabrative Vision-Language Models Nov 8, 2024 Quantization Question Answering
— Unverified 00 Aligning MAGMA by Few-Shot Learning and Finetuning Oct 18, 2022 Few-Shot Learning Image Captioning
— Unverified 00 Alignment, Mining and Fusion: Representation Alignment with Hard Negative Mining and Selective Knowledge Fusion for Medical Visual Question Answering Jan 1, 2025 Contrastive Learning Medical Visual Question Answering
— Unverified 00 AlignVE: Visual Entailment Recognition Based on Alignment Relations Nov 16, 2022 Question Answering Relation
— Unverified 00 All You May Need for VQA are Image Captions Jan 16, 2022 All Image Captioning
— Unverified 00 ``Look, some Green Circles!'': Learning to Quantify from Images Aug 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 00 American == White in Multimodal Language-and-Image AI Jul 1, 2022 Image Captioning Question Answering
— Unverified 00 A Multimodal Memes Classification: A Survey and Open Research Issues Sep 17, 2020 Classification General Classification
— Unverified 00 Analysis of Visual Question Answering Algorithms with attention model May 4, 2023 Question Answering Visual Question Answering
— Unverified 00 Analysis on Image Set Visual Question Answering Mar 31, 2021 Question Answering Visual Question Answering
— Unverified 00 An Analysis of Visual Question Answering Algorithms Mar 28, 2017 Question Answering Visual Question Answering
— Unverified 00