Talking to the brain: Using Large Language Models as Proxies to Model Brain Semantic Representation Feb 26, 2025 Question Answering valid
— Unverified 00 Task-driven Visual Saliency and Attention-based Visual Question Answering Feb 22, 2017 Question Answering Visual Question Answering
— Unverified 00 Task Formulation Matters When Learning Continuously: A Case Study in Visual Question Answering Jan 16, 2022 Continual Learning Incremental Learning
— Unverified 00 Task-Oriented Multi-User Semantic Communications Dec 19, 2021 Image Retrieval Machine Translation
— Unverified 00 Task-Oriented Semantic Communication in Large Multimodal Models-based Vehicle Networks May 5, 2025 Question Answering Semantic Communication
— Unverified 00 Task Progressive Curriculum Learning for Robust Visual Question Answering Nov 26, 2024 Data Augmentation Ensemble Learning
— Unverified 00 TA-Student VQA: Multi-Agents Training by Self-Questioning Jun 1, 2020 Diversity Question Answering
— Unverified 00 TDVE-Assessor: Benchmarking and Evaluating the Quality of Text-Driven Video Editing with LMMs May 26, 2025 Benchmarking Large Language Model
— Unverified 00 Tell-and-Answer: Towards Explainable Visual Question Answering using Attributes and Captions Jan 27, 2018 Attribute Image Captioning
— Unverified 00 Tell Me the Evidence? Dual Visual-Linguistic Interaction for Answer Grounding Jun 21, 2022 Decoder Question Answering
— Unverified 00 Test-Time Adaptation for Visual Document Understanding Jun 15, 2022 document understanding Domain Adaptation
— Unverified 00 Text-Aware Dual Routing Network for Visual Question Answering Nov 17, 2022 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 00 Text-Conditioned Generative Model of 3D Strand-based Human Hairstyles Jan 1, 2024 Question Answering Visual Question Answering
— Unverified 00 Text Guided Person Image Synthesis Apr 10, 2019 Attribute Image Generation
— Unverified 00 TextMatch: Enhancing Image-Text Consistency Through Multimodal Optimization Dec 24, 2024 In-Context Learning Question Answering
— Unverified 00 DuReader_vis: A Chinese Dataset for Open-domain Document Visual Question Answering May 1, 2022 document understanding Open-Domain Question Answering
— Unverified 00 TextSquare: Scaling up Text-Centric Visual Instruction Tuning Apr 19, 2024 Hallucination Hallucination Evaluation
— Unverified 00 Textually Enriched Neural Module Networks for Visual Question Answering Sep 23, 2018 Image Captioning Question Answering
— Unverified 00 TG-VQA: Ternary Game of Video Question Answering May 17, 2023 Contrastive Learning Question Answering
— Unverified 00 The Color of the Cat is Gray: 1 Million Full-Sentences Visual Question Answering (FSVQA) Sep 21, 2016 Question Answering Sentence
— Unverified 00 The curse of language biases in remote sensing VQA: the role of spatial attributes, language diversity, and the need for clear evaluation Nov 28, 2023 Diversity Question Answering
— Unverified 00 The Development of Multimodal Lexical Resources Dec 1, 2016 Question Answering Visual Question Answering (VQA)
— Unverified 00 The Forgettable-Watcher Model for Video Question Answering May 3, 2017 model Question Answering
— Unverified 00 The Impact of Explanations on AI Competency Prediction in VQA Jul 2, 2020 AI Agent Language Modeling
— Unverified 00 The meaning of "most" for visual question answering models Dec 31, 2018 Question Answering Visual Question Answering
— Unverified 00 The Meaning of ``Most'' for Visual Question Answering Models Aug 1, 2019 Question Answering Visual Question Answering
— Unverified 00 The Quest for Visual Understanding: A Journey Through the Evolution of Visual Question Answering Jan 13, 2025 Common Sense Reasoning Question Answering
— Unverified 00 The VQA-Machine: Learning How to Use Existing Vision Algorithms to Answer New Questions Dec 16, 2016 BIG-bench Machine Learning Question Answering
— Unverified 00 The Wisdom of MaSSeS: Majority, Subjectivity, and Semantic Similarity in the Evaluation of VQA Sep 12, 2018 Question Answering Semantic Similarity
— Unverified 00 TinyDrive: Multiscale Visual Question Answering with Selective Token Routing for Autonomous Driving May 21, 2025 Autonomous Driving Question Answering
— Unverified 00 TinyRS-R1: Compact Multimodal Language Model for Remote Sensing May 17, 2025 Language Modeling Language Modelling
— Unverified 00 TinyVQA: Compact Multimodal Deep Neural Network for Visual Question Answering on Resource-Constrained Devices Apr 4, 2024 Quantization Question Answering
— Unverified 00 TM-PATHVQA:90000+ Textless Multilingual Questions for Medical Visual Question Answering Jul 16, 2024 Medical Visual Question Answering Question Answering
— Unverified 00 TokenFocus-VQA: Enhancing Text-to-Image Alignment with Position-Aware Focus and Multi-Perspective Aggregations on LVLMs Apr 10, 2025 Ensemble Learning Position
— Unverified 00 Toward 3D Spatial Reasoning for Human-like Text-based Visual Question Answering Sep 21, 2022 Image Captioning Optical Character Recognition (OCR)
— Unverified 00 Toward Effective Reinforcement Learning Fine-Tuning for Medical VQA in Vision-Language Models May 20, 2025 Medical Visual Question Answering Question Answering
— Unverified 00 Toward Robust Hyper-Detailed Image Captioning: A Multiagent Approach and Dual Evaluation Metrics for Factuality and Coverage Dec 20, 2024 Attribute Benchmarking
— Unverified 00 Towards a Unified Model for Generating Answers and Explanations in Visual Question Answering Jan 25, 2023 Decoder Explanation Generation
— Unverified 00 Towards Automated Error Analysis: Learning to Characterize Errors Jan 13, 2022 Common Sense Reasoning Meta-Learning
— Unverified 00 Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing Dec 16, 2019 Question Answering Visual Question Answering
— Unverified 00 Towards Complex Document Understanding By Discrete Reasoning Jul 25, 2022 document understanding Question Answering
— Unverified 00 Towards Developing a Multilingual and Code-Mixed Visual Question Answering System by Knowledge Distillation Sep 10, 2021 Knowledge Distillation Question Answering
— Unverified 00 Towards Escaping from Language Bias and OCR Error: Semantics-Centered Text Visual Question Answering Mar 24, 2022 Optical Character Recognition Optical Character Recognition (OCR)
— Unverified 00 Towards General Purpose Vision Systems: An End-to-End Task-Agnostic Vision-Language Architecture Jan 1, 2022 Question Answering Visual Question Answering
— Unverified 00 Towards Human-Level Understanding of Complex Process Engineering Schematics: A Pedagogical, Introspective Multi-Agent Framework for Open-Domain Question Answering Aug 24, 2024 knowledge editing Open-Domain Question Answering
— Unverified 00 Towards Models that Can See and Read Jan 18, 2023 Decoder Image Captioning
— Unverified 00 Towards Reasoning-Aware Explainable VQA Nov 9, 2022 Decoder Explanation Generation
— Unverified 00 Towards Top-Down Reasoning: An Explainable Multi-Agent Approach for Visual Question Answering Nov 29, 2023 Common Sense Reasoning Question Answering
— Unverified 00 Towards Transparent AI Systems: Interpreting Visual Question Answering Models Aug 31, 2016 Question Answering Visual Question Answering
— Unverified 00 Towards Unsupervised Visual Reasoning: Do Off-The-Shelf Features Know How to Reason? Dec 20, 2022 Question Answering Representation Learning
— Unverified 00