Instruction-augmented Multimodal Alignment for Image-Text and Element Matching Apr 16, 2025 Image Augmentation Image Generation
— Unverified 00 Integrating Frequency-Domain Representations with Low-Rank Adaptation in Vision-Language Models Mar 8, 2025 Caption Generation Question Answering
— Unverified 00 Integrating Knowledge and Reasoning in Image Understanding Jun 24, 2019 Object Recognition Question Answering
— Unverified 00 Interactive Attention AI to translate low light photos to captions for night scene understanding in women safety Jan 4, 2022 Decoder Deep Learning
— Unverified 00 Interactive Visual Task Learning for Robots Dec 20, 2023 Continual Learning Novel Concepts
— Unverified 00 Dynamic Clue Bottlenecks: Towards Interpretable-by-Design Visual Question Answering May 24, 2023 Question Answering Visual Question Answering
— Unverified 00 Interpretable Counting for Visual Question Answering Dec 23, 2017 Question Answering Visual Question Answering
— Unverified 00 Interpretable Face Anti-Spoofing: Enhancing Generalization with Multimodal Large Language Models Jan 3, 2025 Binary Classification Face Anti-Spoofing
— Unverified 00 Interpretable Medical Image Visual Question Answering via Multi-Modal Relationship Graph Learning Feb 19, 2023 Graph Learning Medical Visual Question Answering
— Unverified 00 Interpretable Neural Computation for Real-World Compositional Visual Question Answering Oct 10, 2020 Question Answering Visual Question Answering
— Unverified 00 Interpretable Visual Question Answering Referring to Outside Knowledge Mar 8, 2023 Diversity Image Captioning
— Unverified 00 Interpretable Visual Question Answering by Reasoning on Dependency Trees Sep 6, 2018 Question Answering valid
— Unverified 00 Interpretable Visual Question Answering by Visual Grounding from Attention Supervision Mining Aug 1, 2018 Question Answering Visual Grounding
— Unverified 00 Interpretable Visual Question Answering via Reasoning Supervision Sep 7, 2023 Common Sense Reasoning Question Answering
— Unverified 00 Interpretable Visual Reasoning via Probabilistic Formulation under Natural Supervision Aug 1, 2020 Question Answering Visual Question Answering
— Unverified 00 Inverse Visual Question Answering: A New Benchmark and VQA Diagnosis Tool Mar 16, 2018 Question Answering Reinforcement Learning
— Unverified 00 Inverse Visual Question Answering with Multi-Level Attentions Sep 17, 2019 Question Answering Visual Question Answering
— Unverified 00 Investigating Biases in Textual Entailment Datasets Jun 23, 2019 BIG-bench Machine Learning Natural Language Inference
— Unverified 00 Investigating layer-selective transfer learning of QAOA parameters for Max-Cut problem Dec 30, 2024 Combinatorial Optimization Transfer Learning
— Unverified 00 ISAAQ -- Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention Oct 1, 2020 Multiple-choice Question Answering
— Unverified 00 ISAAQ - Mastering Textbook Questions with Pre-trained Transformers and Bottom-Up and Top-Down Attention Nov 1, 2020 Multiple-choice Question Answering
— Unverified 00 Is Cognition consistent with Perception? Assessing and Mitigating Multimodal Knowledge Conflicts in Document Understanding Nov 12, 2024 document understanding Optical Character Recognition (OCR)
— Unverified 00 Is GPT-3 all you need for Visual Question Answering in Cultural Heritage? Jul 25, 2022 All Question Answering
— Unverified 00 Iterated learning for emergent systematicity in VQA May 3, 2021 Question Answering Systematic Generalization
— Unverified 00 It Takes Two to Tango: Towards Theory of AI's Mind Apr 3, 2017 Attribute Question Answering
— Unverified 00 iVQA: Inverse Visual Question Answering Oct 10, 2017 Question Answering Question Generation
— Unverified 00 Jaeger: A Concatenation-Based Multi-Transformer VQA Model Oct 11, 2023 Dimensionality Reduction model
— Unverified 00 Joint Image Captioning and Question Answering May 22, 2018 Image Captioning Question Answering
— Unverified 00 Joint learning of object graph and relation graph for visual question answering May 9, 2022 Attribute Graph Neural Network
— Unverified 00 Jointly Learning Truth-Conditional Denotations and Groundings using Parallel Attention Apr 14, 2021 Question Answering Visual Question Answering
— Unverified 00 JTD-UAV: MLLM-Enhanced Joint Tracking and Description Framework for Anti-UAV Systems Jan 1, 2025 Question Answering Visual Question Answering
— Unverified 00 `Just because you are right, doesn't mean I am wrong': Overcoming a bottleneck in development and evaluation of Open-Ended VQA tasks Apr 1, 2021 Question Answering Visual Question Answering
— Unverified 00 KAT: A Knowledge Augmented Transformer for Vision-and-Language Jan 16, 2022 Answer Generation Decoder
— Unverified 00 Kernel Pooling for Convolutional Neural Networks Jul 1, 2017 Face Recognition Fine-Grained Visual Categorization
— Unverified 00 Generating and Evaluating Explanations of Attended and Error-Inducing Input Regions for VQA Models Mar 26, 2021 Question Answering Visual Question Answering
— Unverified 00 Knowing Where to Look? Analysis on Attention of Visual Question Answering System Oct 9, 2018 Question Answering Visual Question Answering
— Unverified 00 KnowIT VQA: Answering Knowledge-Based Questions about Videos Oct 23, 2019 Question Answering Video Question Answering
— Unverified 00 Knowledge Acquisition for Visual Question Answering via Iterative Querying Jul 1, 2017 Question Answering Visual Question Answering
— Unverified 00 Knowledge-Based Counterfactual Queries for Visual Question Answering Mar 5, 2023 counterfactual Decision Making
— Unverified 00 Knowledge-Based Visual Question Answering in Videos Apr 17, 2020 Question Answering Video Question Answering
— Unverified 00 Knowledge Condensation and Reasoning for Knowledge-based VQA Mar 15, 2024 Question Answering Reading Comprehension
— Unverified 00 Knowledge Detection by Relevant Question and Image Attributes in Visual Question Answering Jun 8, 2023 Question Answering Retrieval
— Unverified 00 KNVQA: A Benchmark for evaluation knowledge-based VQA Nov 21, 2023 Hallucination Object Hallucination
— Unverified 00 KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA Dec 20, 2020 Visual Question Answering (VQA)
— Unverified 00 KVL-BERT: Knowledge Enhanced Visual-and-Linguistic BERT for Visual Commonsense Reasoning Dec 13, 2020 Sentence Visual Commonsense Reasoning
— Unverified 00 KVQA: Knowledge-Aware Visual Question Answering Jul 17, 2019 Knowledge Graphs Question Answering
— Unverified 00 Language bias in Visual Question Answering: A Survey and Taxonomy Nov 16, 2021 Question Answering Visual Question Answering
— Unverified 00 Language Features Matter: Effective Language Representations for Vision-Language Tasks Aug 17, 2019 Image Captioning Language Modelling
— Unverified 00 Language Models are General-Purpose Interfaces Jun 13, 2022 Causal Language Modeling Few-Shot Learning
— Unverified 00 LAPDoc: Layout-Aware Prompting for Documents Feb 15, 2024 document understanding Key Information Extraction
— Unverified 00