Overcoming Language Bias in Remote Sensing Visual Question Answering via Adversarial Training Jun 1, 2023 Question Answering Visual Question Answering
— Unverified 00 Overcoming Language Priors for Visual Question Answering Based on Knowledge Distillation Jan 10, 2025 Knowledge Distillation Question Answering
— Unverified 00 Overcoming Language Priors in Visual Question Answering with Adversarial Regularization Oct 8, 2018 Question Answering Visual Grounding
— Unverified 00 OVQA: A Clinically Generated Visual Question Answering Dataset Jul 7, 2022 Benchmarking Medical Visual Question Answering
— Unverified 00 PaLI: A Jointly-Scaled Multilingual Language-Image Model Sep 14, 2022 Decoder Few-Shot Image Classification
— Unverified 00 PaLM2-VAdapter: Progressively Aligned Language Model Makes a Strong Vision-language Adapter Feb 16, 2024 Language Modeling Language Modelling
— Unverified 00 PAM: Understanding Product Images in Cross Product Category Attribute Extraction Jun 8, 2021 Attribute Attribute Extraction
— Unverified 00 NAPA: Intermediate-level Variational Native-pulse Ansatz for Variational Quantum Algorithms Aug 2, 2022 Neural Architecture Search Visual Question Answering (VQA)
— Unverified 00 Parameter-Parallel Distributed Variational Quantum Algorithm Jul 31, 2022 Visual Question Answering (VQA)
— Unverified 00 ParsVQA-Caps: A Benchmark for Visual Question Answering and Image Captioning in Persian Dec 7, 2022 Image Captioning Question Answering
— Unverified 00 Pathological Visual Question Answering Oct 6, 2020 AI Agent Question Answering
— Unverified 00 PathVLM-R1: A Reinforcement Learning-Driven Reasoning Model for Pathology Visual-Language Tasks Apr 12, 2025 Computed Tomography (CT) Question Answering
— Unverified 00 PDF-MVQA: A Dataset for Multimodal Information Retrieval in PDF-based Visual Question Answering Apr 19, 2024 Articles Information Retrieval
— Unverified 00 PDFVQA: A New Dataset for Real-World VQA on PDF Documents Apr 13, 2023 document understanding Key Information Extraction
— Unverified 00 Perception Test 2024: Challenge Summary and a Novel Hour-Long VideoQA Benchmark Nov 29, 2024 Benchmarking Grounded Video Question Answering
— Unverified 00 Perceptual Quality Assessment of UGC Gaming Videos Mar 31, 2022 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 Performance Analysis of Traditional VQA Models Under Limited Computational Resources Feb 9, 2025 Question Answering Visual Question Answering
— Unverified 00 PhyBlock: A Progressive Benchmark for Physical Understanding and Planning via 3D Block Assembly Jun 10, 2025 Question Answering Scene Understanding
— Unverified 00 PiggyBack: Pretrained Visual Question Answering Environment for Backing up Non-deep Learning Professionals Nov 29, 2022 Deep Learning Question Answering
— Unverified 00 PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models May 20, 2025 Visual Question Answering (VQA)
— Unverified 00 Playing Lottery Tickets with Vision and Language Apr 23, 2021 Image-text Retrieval Question Answering
— Unverified 00 Polar-VQA: Visual Question Answering on Remote Sensed Ice sheet Imagery from Polar Region Mar 13, 2023 Question Answering Visual Question Answering
— Unverified 00 Precision Empowers, Excess Distracts: Visual Question Answering With Dynamically Infused Knowledge In Language Models Jun 14, 2024 Decoder Knowledge Graphs
— Unverified 00 Predicting Relative Depth between Objects from Semantic Features Jan 12, 2021 Question Answering Visual Question Answering
— Unverified 00 PreSTU: Pre-Training for Scene-Text Understanding Sep 12, 2022 Decoder Image Captioning
— Unverified 00 Pre-training image-language transformers for open-vocabulary tasks Sep 9, 2022 Question Answering Visual Entailment
— Unverified 00 Priorformer: A UGC-VQA Method with content and distortion priors Jun 24, 2024 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 Privacy-Aware Visual Language Models May 27, 2024 Visual Question Answering (VQA)
— Unverified 00 Privacy Preserving Visual Question Answering Feb 15, 2022 Privacy Preserving Question Answering
— Unverified 00 PRNet: A Progressive Regression Network for No-Reference User-Generated-Content Video Quality Assessment Sep 29, 2021 regression Video Quality Assessment
— Unverified 00 Probabilistic Neural-symbolic Models for Interpretable Visual Question Answering Feb 21, 2019 counterfactual Question Answering
— Unverified 00 Probing Inter-modality: Visual Parsing with Self-Attention for Vision-Language Pre-training Jun 25, 2021 Image-text Retrieval Question Answering
— Unverified 00 Probing Inter-modality: Visual Parsing with Self-Attention for Vision-and-Language Pre-training May 21, 2021 Question Answering Relation
— Unverified 00 Probing the Role of Positional Information in Vision-Language Models Jan 16, 2022 Contrastive Learning Image-text matching
— Unverified 00 Probing Visual Language Priors in VLMs Dec 31, 2024 Question Answering Visual Question Answering
— Unverified 00 ProcTag: Process Tagging for Assessing the Efficacy of Document Instruction Data Jul 17, 2024 Question Answering Visual Question Answering
— Unverified 00 Progressive Attention Memory Network for Movie Story Question Answering Apr 18, 2019 Question Answering Video Story QA
— Unverified 00 Prolonged Reasoning Is Not All You Need: Certainty-Based Adaptive Routing for Efficient LLM/MLLM Reasoning May 21, 2025 All Visual Question Answering (VQA)
— Unverified 00 Prompt-based Personalized Federated Learning for Medical Visual Question Answering Feb 15, 2024 Federated Learning Medical Visual Question Answering
— Unverified 00 PromptCap: Prompt-Guided Image Captioning for VQA with GPT-3 Jan 1, 2023 Image Captioning Question Answering
— Unverified 00 Prompting Large Language Models with Rationale Heuristics for Knowledge-based Visual Question Answering Dec 22, 2024 Question Answering Visual Question Answering
— Unverified 00 Prompt Tuning for Generative Multimodal Pretrained Models Aug 4, 2022 Image Captioning Visual Entailment
— Unverified 00 Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention May 5, 2021 Question Answering Referring Expression
— Unverified 00 Proposing Plausible Answers for Open-ended Visual Question Answering Oct 20, 2016 Graph Matching Open-Ended Question Answering
— Unverified 00 Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning Jun 11, 2025 In-Context Learning Question Answering
— Unverified 00 Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting May 30, 2025 image-classification Image Classification
— Unverified 00 Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering Jun 10, 2019 Continual Learning Question Answering
— Unverified 00 PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild May 28, 2024 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 00 Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information Jul 1, 2018 Image Classification Machine Translation
— Unverified 00 PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving Apr 15, 2025 Logical Reasoning Visual Question Answering (VQA)
— Unverified 00