Validity-Based Sampling and Smoothing Methods for Multiple Reference Image Captioning Jun 1, 2021 Image Captioning valid
— Unverified 00 Variance-Based Membership Inference Attacks Against Large-Scale Image Captioning Models Jan 1, 2025 Image Captioning Memorization
— Unverified 00 Variational Distribution Learning for Unsupervised Text-to-Image Generation Mar 28, 2023 Image Captioning Image Generation
— Unverified 00 Variational Structured Semantic Inference for Diverse Image Captioning Dec 1, 2019 Decoder Diversity
— Unverified 00 A Frustratingly Simple Approach for End-to-End Image Captioning Jan 30, 2022 Decoder Image Captioning
— Unverified 00 VCRScore: Image captioning metric based on V\&L Transformers, CLIP, and precision-recall Jan 15, 2025 Image Captioning
— Unverified 00 Vector Learning for Cross Domain Representations Sep 27, 2018 Decoder Image Captioning
— Unverified 00 ViCor: Bridging Visual Understanding and Commonsense Reasoning with Large Language Models Oct 9, 2023 Image Captioning Visual Commonsense Reasoning
— Unverified 00 Video Event Detection by Exploiting Word Dependencies from Image Captions Dec 1, 2016 Action Detection Event Detection
— Unverified 00 VideoGameBunny: Towards vision assistants for video games Jul 21, 2024 Image Captioning Scene Understanding
— Unverified 00 VieCap4H-VLSP 2021: ObjectAoA-Enhancing performance of Object Relation Transformer with Attention on Attention for Vietnamese image captioning Nov 10, 2022 Image Captioning Vietnamese Image Captioning
— Unverified 00 vieCap4H-VLSP 2021: Vietnamese Image Captioning for Healthcare Domain using Swin Transformer and Attention-based LSTM Sep 3, 2022 Decoder Image Captioning
— Unverified 00 Violet: A Vision-Language Model for Arabic Image Captioning with Gemini Decoder Nov 15, 2023 Decoder Image Captioning
— Unverified 00 VipAct: Visual-Perception Enhancement via Specialized VLM Agent Collaboration and Tool-use Oct 21, 2024 Image Captioning Task Planning
— Unverified 00 ViP-CNN: Visual Phrase Guided Convolutional Neural Network Feb 23, 2017 Descriptive Image Captioning
— Unverified 00 VisBuddy -- A Smart Wearable Assistant for the Visually Challenged Aug 17, 2021 Image Captioning object-detection
— Unverified 00 VisCon-100K: Leveraging Contextual Web Data for Fine-tuning Vision Language Models Feb 14, 2025 Image Captioning Large Language Model
— Unverified 00 Vision and Language Integration: Moving beyond Objects Jan 1, 2017 Action Classification Image Captioning
— Unverified 00 Vision Language Model-based Caption Evaluation Method Leveraging Visual Context Extraction Feb 28, 2024 Image Captioning Language Modeling
— Unverified 00 Vision Language Models Can Parse Floor Plan Maps Sep 19, 2024 Image Captioning Question Answering
— Unverified 00 Vision-Language Models for Edge Networks: A Comprehensive Survey Feb 11, 2025 Autonomous Vehicles Image Captioning
— Unverified 00 Vision-Language Models Represent Darker-Skinned Black Individuals as More Homogeneous than Lighter-Skinned Black Individuals Dec 12, 2024 Image Captioning Image Generation
— Unverified 00 Vision-to-Language Tasks Based on Attributes and Attention Mechanism May 29, 2019 Image Captioning Question Answering
— Unverified 00 Vispi: Automatic Visual Perception and Interpretation of Chest X-rays Jun 12, 2019 Diagnostic Image Captioning
— Unverified 00 Visual Analytics for Efficient Image Exploration and User-Guided Image Captioning Nov 2, 2023 Caption Generation Efficient Exploration
— Unverified 00 Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences Jul 31, 2023 Decoder Image Captioning
— Unverified 00 Visual Classifier Prediction by Distributional Semantic Embedding of Text Descriptions Sep 1, 2015 Domain Adaptation Image Captioning
— Unverified 00 Visual Hallucination: Definition, Quantification, and Prescriptive Remediations Mar 26, 2024 Hallucination Image Captioning
— Unverified 00 Visual Information Matters for ASR Error Correction Mar 16, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Visually Guided Spatial Relation Extraction from Text Jun 1, 2018 Activity Recognition Image Captioning
— Unverified 00 Visual Question Answering Dataset for Bilingual Image Understanding: A Study of Cross-Lingual Transfer Using Attention Maps Aug 1, 2018 Cross-Lingual Transfer Image Captioning
— Unverified 00 Visual representation of negation: Real world data analysis on comic image design May 21, 2021 Image Captioning image-classification
— Unverified 00 Visual Transformer for Object Detection Jun 1, 2022 Image Captioning Machine Translation
— Unverified 00 ViTOC: Vision Transformer and Object-aware Captioner Nov 9, 2024 Diversity Image Captioning
— Unverified 00 VIVO: Visual Vocabulary Pre-Training for Novel Object Captioning Sep 28, 2020 Image Captioning Object
— Unverified 00 VLRM: Vision-Language Models act as Reward Models for Image Captioning Apr 2, 2024 Image Captioning reinforcement-learning
— Unverified 00 VolDoGer: LLM-assisted Datasets for Domain Generalization in Vision-Language Tasks Jul 29, 2024 Deep Learning Domain Generalization
— Unverified 00 Wasserstein Barycenter Model Ensembling May 1, 2019 Attribute General Classification
— Unverified 00 Weakly Supervised Annotations for Multi-modal Greeting Cards Dataset Dec 1, 2022 Image Captioning Image Generation
— Unverified 00 Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models Sep 10, 2020 Caption Generation Denoising
— Unverified 00 WEmbSim: A Simple yet Effective Metric for Image Captioning Dec 24, 2020 Image Captioning Word Embeddings
— Unverified 00 What a Whole Slide Image Can Tell? Subtype-guided Masked Transformer for Pathological Image Captioning Oct 31, 2023 Image Captioning Sentence
— Unverified 00 What Else Would I Like? A User Simulator using Alternatives for Improved Evaluation of Fashion Conversational Recommendation Systems Jan 11, 2024 Conversational Recommendation Image Captioning
— Unverified 00 What Is a Good Caption? A Comprehensive Visual Caption Benchmark for Evaluating Both Correctness and Thoroughness Feb 19, 2025 Image Captioning Keyword Extraction
— Unverified 00 What is not where: the challenge of integrating spatial representations into deep learning architectures Jul 21, 2018 Caption Generation Deep Learning
— Unverified 00 When Radiology Report Generation Meets Knowledge Graph Feb 19, 2020 Graph Embedding Image Captioning
— Unverified 00 When to Finish? Optimal Beam Search for Neural Text Generation (modulo beam size) Aug 31, 2018 Image Captioning Machine Translation
— Unverified 00 Where to Play: Retrieval of Video Segments using Natural-Language Queries Jul 2, 2017 Image Captioning Natural Language Queries
— Unverified 00 “Wikily” Supervised Neural Translation Tailored to Cross-Lingual Tasks Nov 1, 2021 Cross-Lingual Transfer Cross-Lingual Word Embeddings
— Unverified 00 WMT 2016 Multimodal Translation System Description based on Bidirectional Recurrent Neural Networks with Double-Embeddings Aug 1, 2016 Image Captioning Language Modeling
— Unverified 00