Video Question Generation via Cross-Modal Self-Attention Networks Learning Jul 5, 2019 Diversity Question Answering
— Unverified 00 VideoCoCa: Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners Dec 9, 2022 Question Answering Retrieval
— Unverified 00 Video Understanding as Machine Translation Jun 12, 2020 Machine Translation Metric Learning
— Unverified 00 Vietnamese Legal Information Retrieval in Question-Answering System Sep 5, 2024 Hallucination Information Retrieval
— Unverified 00 ViLMedic: a framework for research at the intersection of vision and language in medical AI May 1, 2022 Medical Visual Question Answering Question Answering
— Unverified 00 VilNMN: A Neural Module Network approach to Video-Grounded Language Tasks Jan 1, 2021 Information Retrieval Question Answering
— Unverified 00 Vi-Mistral-X: Building a Vietnamese Language Model with Advanced Continual Pre-training Mar 20, 2024 Language Modeling Language Modelling
— Unverified 00 Vintern-1B: An Efficient Multimodal Large Language Model for Vietnamese Aug 22, 2024 Language Modeling Language Modelling
— Unverified 00 VisDoM: Multi-Document QA with Visually Rich Elements Using Multimodal Retrieval-Augmented Generation Dec 14, 2024 Question Answering RAG
— Unverified 00 Vision-Amplified Semantic Entropy for Hallucination Detection in Medical Visual Question Answering Mar 26, 2025 Diagnostic Hallucination
— Unverified 00 Vision and Language: from Visual Perception to Content Creation Dec 26, 2019 Decoder Question Answering
— Unverified 00 Vision and Language Integration: Moving beyond Objects Jan 1, 2017 Action Classification Image Captioning
— Unverified 00 Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It Jul 17, 2025 Question Answering
— Unverified 00 VisionGPT: Vision-Language Understanding Agent Using Generalized Multimodal Framework Mar 14, 2024 Language Modeling Language Modelling
— Unverified 00 Vision Language Model for Interpretable and Fine-grained Detection of Safety Compliance in Diverse Workplaces Aug 13, 2024 Attribute Language Modeling
— Unverified 00 Vision-Language Models as Success Detectors Mar 13, 2023 Question Answering Visual Question Answering
— Unverified 00 Vision Language Models Can Parse Floor Plan Maps Sep 19, 2024 Image Captioning Question Answering
— Unverified 00 Vision-Language Models for Edge Networks: A Comprehensive Survey Feb 11, 2025 Autonomous Vehicles Image Captioning
— Unverified 00 Vision-Language Models Struggle to Align Entities across Modalities Mar 5, 2025 Attribute Code Generation
— Unverified 00 Vision-Language Pretraining: Current Trends and the Future May 1, 2022 Question Answering Representation Learning
— Unverified 00 Vision LLMs Are Bad at Hierarchical Visual Understanding, and LLMs Are the Bottleneck May 30, 2025 Question Answering Visual Question Answering
— Unverified 00 Vision-to-Language Tasks Based on Attributes and Attention Mechanism May 29, 2019 Image Captioning Question Answering
— Unverified 00 VisKE: Visual Knowledge Extraction and Question Answering by Visual Verification of Relation Phrases Jun 1, 2015 Question Answering Relation
— Unverified 00 VisKoP: Visual Knowledge oriented Programming for Interactive Knowledge Base Question Answering Jul 6, 2023 Knowledge Base Question Answering Program induction
— Unverified 00 VISREAS: Complex Visual Reasoning with Unanswerable Questions Feb 23, 2024 Question Answering Visual Question Answering
— Unverified 00 VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning Sep 10, 2024 Question Answering Visual Question Answering
— Unverified 00 VISTA-LLAMA: Reducing Hallucination in Video Language Models via Equal Distance to Visual Tokens Jan 1, 2024 Hallucination Position
— Unverified 00 Vista-LLaMA: Reliable Video Narrator via Equal Distance to Visual Tokens Dec 12, 2023 Hallucination Position
— Unverified 00 Visual7W: Grounded Question Answering in Images Nov 11, 2015 Multiple-choice Multiple Choice Question Answering (MCQA)
— Unverified 00 Visual Attention Model for Name Tagging in Multimodal Social Media Jul 1, 2018 Natural Language Understanding Question Answering
— Unverified 00 Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings May 3, 2023 Data Augmentation Question Answering
— Unverified 00 Visual Commonsense based Heterogeneous Graph Contrastive Learning Nov 11, 2023 Contrastive Learning Question Answering
— Unverified 00 Visual Entailment: A Novel Task for Fine-Grained Image Understanding Jan 20, 2019 Natural Language Inference Question Answering
— Unverified 00 Visual Entailment Task for Visually-Grounded Language Learning Nov 26, 2018 Grounded language learning Natural Language Inference
— Unverified 00 Visual Environment-Interactive Planning for Embodied Complex-Question Answering Apr 1, 2025 Question Answering Task Planning
— Unverified 00 Visual Explanations from Hadamard Product in Multimodal Deep Networks Dec 18, 2017 Question Answering Visual Question Answering
— Unverified 00 Visual Graph Question Answering with ASP and LLMs for Language Parsing Feb 13, 2025 Graph Question Answering Optical Character Recognition
— Unverified 00 Visual Grounding Strategies for Text-Only Natural Language Processing Mar 25, 2021 Image Retrieval Language Modeling
— Unverified 00 Visual Hallucination: Definition, Quantification, and Prescriptive Remediations Mar 26, 2024 Hallucination Image Captioning
— Unverified 00 Visual Instruction Bottleneck Tuning May 20, 2025 Hallucination Object Hallucination
— Unverified 00 Visualizing Sentiment Analysis on a User Forum May 1, 2012 Opinion Mining Question Answering
— Unverified 00 Visually Guided Spatial Relation Extraction from Text Jun 1, 2018 Activity Recognition Image Captioning
— Unverified 00 Visual Madlibs: Fill in the Blank Description Generation and Question Answering Dec 1, 2015 Multiple-choice Question Answering
— Unverified 00 Visual Madlibs: Fill in the blank Image Generation and Question Answering May 31, 2015 Image Generation Multiple-choice
— Unverified 00 Visual Perturbation-aware Collaborative Learning for Overcoming the Language Prior Problem Jul 24, 2022 Diagnostic Question Answering
— Unverified 00 Visual Question Answering as a Meta Learning Task Nov 22, 2017 Meta-Learning Question Answering
— Unverified 00 Visual Question Answering as a Multi-Task Problem Jul 3, 2020 Question Answering Visual Question Answering
— Unverified 00 Visual Question Answering as Reading Comprehension Nov 29, 2018 Common Sense Reasoning General Knowledge
— Unverified 00 Visual Question Answering: A Survey on Techniques and Common Trends in Recent Literature May 18, 2023 Question Answering Visual Question Answering
— Unverified 00 Visual question answering based evaluation metrics for text-to-image generation Nov 15, 2024 Image Generation Image Manipulation
— Unverified 00