Proposal-free One-stage Referring Expression via Grid-Word Cross-Attention May 5, 2021 Question Answering Referring Expression
— Unverified 0Proposing Plausible Answers for Open-ended Visual Question Answering Oct 20, 2016 Graph Matching Open-Ended Question Answering
— Unverified 0Provoking Multi-modal Few-Shot LVLM via Exploration-Exploitation In-Context Learning Jun 11, 2025 In-Context Learning Question Answering
— Unverified 0Proxy-FDA: Proxy-based Feature Distribution Alignment for Fine-tuning Vision Foundation Models without Forgetting May 30, 2025 image-classification Image Classification
— Unverified 0Psycholinguistics meets Continual Learning: Measuring Catastrophic Forgetting in Visual Question Answering Jun 10, 2019 Continual Learning Question Answering
— Unverified 0PTM-VQA: Efficient Video Quality Assessment Leveraging Diverse PreTrained Models from the Wild May 28, 2024 Video Quality Assessment Visual Question Answering (VQA)
— Unverified 0Pushing the Limits of Radiology with Joint Modeling of Visual and Textual Information Jul 1, 2018 Image Classification Machine Translation
— Unverified 0PuzzleBench: A Fully Dynamic Evaluation Framework for Large Multimodal Models on Puzzle Solving Apr 15, 2025 Logical Reasoning Visual Question Answering (VQA)
— Unverified 0Pyramid Coder: Hierarchical Code Generator for Compositional Visual Question Answering Jul 30, 2024 Code Generation Question Answering
— Unverified 0Q2ATransformer: Improving Medical VQA via an Answer Querying Decoder Apr 4, 2023 Classification Decoder
— Unverified 0Q-Boost: On Visual Quality Assessment Ability of Low-level Multi-Modality Foundation Models Dec 23, 2023 Image Quality Assessment Video Quality Assessment
— Unverified 0QIRL: Boosting Visual Question Answering via Optimized Question-Image Relation Learning Apr 4, 2025 Data Augmentation Image Generation
— Unverified 0QSAN: A Near-term Achievable Quantum Self-Attention Network Jul 14, 2022 Binary Classification image-classification
— Unverified 0QTG-VQA: Question-Type-Guided Architectural for VideoQA Systems Sep 14, 2024 Question Answering Video Question Answering
— Unverified 0Quality Prediction of AI Generated Images and Videos: Emerging Trends and Opportunities Oct 11, 2024 Denoising Image Quality Assessment
— Unverified 0Question-Agnostic Attention for Visual Question Answering Aug 9, 2019 Question Answering Visual Question Answering
— Unverified 0Question-Conditioned Counterfactual Image Generation for VQA Nov 14, 2019 counterfactual Image Generation
— Unverified 0Question-Driven Graph Fusion Network For Visual Question Answering Apr 3, 2022 Graph Attention Object
— Unverified 0Question Generation for Evaluating Cross-Dataset Shifts in Multi-modal Grounding Jan 24, 2022 Question Answering Question Generation
— Unverified 0Question-Guided Hybrid Convolution for Visual Question Answering Aug 8, 2018 Question Answering Visual Question Answering
— Unverified 0Question Guided Modular Routing Networks for Visual Question Answering Apr 17, 2019 Question Answering Visual Question Answering
— Unverified 0Question-Led Semantic Structure Enhanced Attentions for VQA Nov 16, 2021 Question Answering Visual Question Answering
— Unverified 0Question Modifiers in Visual Question Answering Jun 1, 2022 Natural Language Understanding Question Answering
— Unverified 0Question Relevance in Visual Question Answering Jul 23, 2018 Question Answering Visual Question Answering
— Unverified 0Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions Jun 21, 2016 Question Answering Question Similarity
— Unverified 0Question Type Guided Attention in Visual Question Answering Apr 6, 2018 Activity Recognition Question Answering
— Unverified 0R^3-VQA: "Read the Room" by Video Social Reasoning May 7, 2025 State Estimation Visual Question Answering (VQA)
— Unverified 0RankDVQA-mini: Knowledge Distillation-Driven Deep Video Quality Assessment Dec 14, 2023 Knowledge Distillation Model Compression
— Unverified 0RAVEN: A Dataset for Relational and Analogical Visual rEasoNing Mar 7, 2019 Object Recognition Question Answering
— Unverified 0RAVEN: Multitask Retrieval Augmented Vision-Language Learning Jun 27, 2024 Image Captioning RAG
— Unverified 0Reactive Multi-Stage Feature Fusion for Multimodal Dialogue Modeling Aug 14, 2019 Question Answering Scene-Aware Dialogue
— Unverified 0Realizing Visual Question Answering for Education: GPT-4V as a Multimodal AI May 12, 2024 Question Answering Visual Question Answering
— Unverified 0Reasoning LLMs for User-Aware Multimodal Conversational Agents Apr 2, 2025 RAG Retrieval-augmented Generation
— Unverified 0Reasoning Over History: Context Aware Visual Dialog Nov 2, 2020 coreference-resolution Coreference Resolution
— Unverified 0Reasoning over Vision and Language: Exploring the Benefits of Supplemental Knowledge Jan 15, 2021 Question Answering Visual Question Answering (VQA)
— Unverified 0Recent Advances in Video Question Answering: A Review of Datasets and Methods Jan 15, 2021 Information Retrieval Machine Translation
— Unverified 0Recent, rapid advancement in visual question answering architecture: a review Mar 2, 2022 Question Answering Visual Question Answering
— Unverified 0Reciprocal Attention Fusion for Visual Question Answering May 11, 2018 Object Question Answering
— Unverified 0Recurrent and Contextual Models for Visual Question Answering Mar 23, 2017 Diversity Multiple-choice
— Unverified 0Reducing Hallucinations: Enhancing VQA for Flood Disaster Damage Assessment with Visual Contexts Dec 21, 2023 Hallucination Question Answering
— Unverified 0Reducing Language Biases in Visual Question Answering with Visually-Grounded Question Encoder Jul 13, 2020 Question Answering Visual Grounding
— Unverified 0Regularizing Attention Networks for Anomaly Detection in Visual Question Answering Sep 21, 2020 Anomaly Detection Question Answering
— Unverified 0Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models Apr 18, 2024 GSM8K MMLU
— Unverified 0Rephrasing visual questions by specifying the entropy of the answer distribution Apr 10, 2020 Question Answering Visual Question Answering
— Unverified 0Representation, Learning and Reasoning on Spatial Language for Downstream NLP Tasks Nov 1, 2020 Common Sense Reasoning Question Answering
— Unverified 0Representing Movie Characters in Dialogues Nov 1, 2019 Question Answering Relation Classification
— Unverified 0Reproducibility Report for "Learning To Count Objects In Natural Images For Visual Question Answering" May 21, 2018 Question Answering Visual Question Answering
— Unverified 0RepsNet: Combining Vision with Language for Automated Medical Reports Sep 27, 2022 Contrastive Learning Decoder
— Unverified 0RescueADI: Adaptive Disaster Interpretation in Remote Sensing Images with Autonomous Agents Oct 17, 2024 Question Answering Task Planning
— Unverified 0Reassessing Evaluation Practices in Visual Question Answering: A Case Study on Out-of-Distribution Generalization May 24, 2022 Image Captioning Out-of-Distribution Generalization
— Unverified 0