Towards Visual Dialog for Radiology Jul 1, 2020 Question Answering Visual Dialog
— Unverified 00 Toward Unsupervised Realistic Visual Question Answering Mar 9, 2023 Question Answering Visual Question Answering
— Unverified 00 Training Recurrent Answering Units with Joint Loss Minimization for VQA Jun 12, 2016 Question Answering Visual Question Answering
— Unverified 00 Transfer Learning in Visual and Relational Reasoning Nov 27, 2019 Question Answering Relational Reasoning
— Unverified 00 Transferring Visual Attributes from Natural Language to Verified Image Generation May 24, 2023 Image Generation Text to Image Generation
— Unverified 00 Transformers in Vision: A Survey Jan 4, 2021 Action Recognition Activity Recognition
— Unverified 00 Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering Jan 1, 2022 Generative Question Answering Image to text
— Unverified 00 Translation Deserves Better: Analyzing Translation Artifacts in Cross-lingual Visual Question Answering Jun 4, 2024 Data Augmentation Machine Translation
— Unverified 00 Tree Memory Networks for Modelling Long-term Temporal Dependencies Mar 12, 2017 Machine Translation Part-Of-Speech Tagging
— Unverified 00 Triplet-Aware Scene Graph Embeddings Sep 19, 2019 Data Augmentation Graph Embedding
— Unverified 00 Tri-VQA: Triangular Reasoning Medical Visual Question Answering for Multi-Attribute Analysis Jun 21, 2024 Attribute Medical Visual Question Answering
— Unverified 00 TrojVLM: Backdoor Attack Against Vision Language Models Sep 28, 2024 Backdoor Attack Image Captioning
— Unverified 00 TRRNet: Tiered Relation Reasoning for Compositional Visual Question Answering Aug 1, 2020 Object Question Answering
— Unverified 00 TruthLens:A Training-Free Paradigm for DeepFake Detection Mar 19, 2025 Binary Classification DeepFake Detection
— Unverified 00 Trying Bilinear Pooling in Video-QA Dec 18, 2020 Question Answering Video Question Answering
— Unverified 00 Two can play this Game: Visual Dialog with Discriminative Question Generation and Answering Mar 29, 2018 Image Captioning Question Answering
— Unverified 00 TxT: Crossmodal End-to-End Learning with Transformers Sep 9, 2021 Multimodal Reasoning Question Answering
— Unverified 00 UC2: Universal Cross-lingual Cross-modal Vision-and-Language Pre-training Apr 1, 2021 Image-text matching Image-text Retrieval
— Unverified 00 U-CAM: Visual Explanation using Uncertainty based Class Activation Maps Aug 17, 2019 Deep Learning Probabilistic Deep Learning
— Unverified 00 SearchLVLMs: A Plug-and-Play Framework for Augmenting Large Vision-Language Models by Searching Up-to-Date Internet Knowledge May 23, 2024 Question Answering RAG
— Unverified 00 UFO: A UniFied TransfOrmer for Vision-Language Representation Learning Nov 19, 2021 Image Captioning Image-text matching
— Unverified 00 UIT-Saviors at MEDVQA-GI 2023: Improving Multimodal Learning with Image Enhancement for Gastrointestinal Visual Question Answering Jul 6, 2023 Diagnostic Image Enhancement
— Unverified 00 Unanswerable Questions about Images and Texts Jan 25, 2021 Question Answering Visual Question Answering
— Unverified 00 Uncertainty based Class Activation Maps for Visual Question Answering Jan 23, 2020 Deep Learning Probabilistic Deep Learning
— Unverified 00 Uncertainty-based Visual Question Answering: Estimating Semantic Inconsistency between Image and Knowledge Base Nov 16, 2021 Question Answering Semantic Similarity
— Unverified 00 Uncertainty-based Visual Question Answering: Estimating Semantic Inconsistency between Image and Knowledge Base Jul 27, 2022 Question Answering Semantic Similarity
— Unverified 00 Understanding and Constructing Latent Modality Structures in Multi-modal Representation Learning Mar 10, 2023 Few-Shot Image Classification image-classification
— Unverified 00 Understanding and Mitigating Classification Errors Through Interpretable Token Patterns Nov 18, 2023 Classification NER
— Unverified 00 Understanding Attention for Vision-and-Language Tasks Dec 17, 2021 Image Generation Image Retrieval
— Unverified 00 Understanding in Artificial Intelligence Jan 17, 2021 Natural Language Understanding Question Answering
— Unverified 00 Understanding Information Storage and Transfer in Multi-modal Large Language Models Jun 6, 2024 Factual Visual Question Answering Model Editing
— Unverified 00 Understanding Knowledge Gaps in Visual Question Answering: Implications for Gap Identification and Testing Apr 8, 2020 Diversity Question Answering
— Unverified 00 Understanding the Role of Scene Graphs in Visual Question Answering Jan 14, 2021 Graph Generation Question Answering
— Unverified 00 UnICLAM:Contrastive Representation Learning with Adversarial Masking for Unified and Interpretable Medical Vision Question Answering Dec 21, 2022 Data Augmentation Decision Making
— Unverified 00 UniCode: Learning a Unified Codebook for Multimodal Large Language Models Mar 14, 2024 Quantization Visual Question Answering (VQA)
— Unverified 00 Uni-EDEN: Universal Encoder-Decoder Network by Multi-Granular Vision-Language Pre-training Jan 11, 2022 Decoder Image Captioning
— Unverified 00 Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision Language Audio and Action Jan 1, 2024 Image Generation Instruction Following
— Unverified 00 Unified-IO: A Unified Model for Vision, Language, and Multi-Modal Tasks Jun 17, 2022 Depth Estimation Image Generation
— Unverified 00 Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation Dec 10, 2021 Image-text matching Image-text Retrieval
— Unverified 00 Unified Scene Representation and Reconstruction for 3D Large Language Models Apr 19, 2024 3D Reconstruction Scene Understanding
— Unverified 00 Uni-Mlip: Unified Self-supervision for Medical Vision Language Pre-training Nov 20, 2024 Contrastive Learning image-classification
— Unverified 00 UniRVQA: A Unified Framework for Retrieval-Augmented Vision Question Answering via Self-Reflective Joint Training Apr 5, 2025 Articles Question Answering
— Unverified 00 UNITER: Learning UNiversal Image-TExt Representations Sep 25, 2019 Image-text matching Image-text Retrieval
— Unverified 00 Un jeu de données pour répondre à des questions visuelles à propos d’entités nommées en utilisant des bases de connaissances (ViQuAE, a Dataset for Knowledge-based Visual Question Answering about Named Entities) Jun 1, 2022 Question Answering Visual Question Answering
— Unverified 00 Unleashing the Potential of Large Language Model: Zero-shot VQA for Flood Disaster Scenario Dec 4, 2023 Language Modeling Language Modelling
— Unverified 00 Unshuffling Data for Improved Generalization Feb 27, 2020 Clustering Data Augmentation
— Unverified 00 Unshuffling Data for Improved Generalization in Visual Question Answering Jan 1, 2021 Out-of-Distribution Generalization Question Answering
— Unverified 00 Unsupervised Keyword Extraction for Full-sentence VQA Nov 23, 2019 Keyword Extraction Question Answering
— Unverified 00 Unsupervised Vision-and-Language Pre-training via Retrieval-based Multi-Granular Alignment Mar 1, 2022 Retrieval Sentence
— Unverified 00 Unveiling Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA May 31, 2023 counterfactual Counterfactual Inference
— Unverified 00