Learning to Relate from Captions and Bounding Boxes Dec 1, 2019 Image Captioning Relation Classification
— Unverified 00 Learning to Select: A Fully Attentive Approach for Novel Object Captioning Jun 2, 2021 Image Captioning Language Modeling
— Unverified 00 Learning Visual-Linguistic Adequacy, Fidelity, and Fluency for Novel Object Captioning Sep 29, 2021 Image Captioning
— Unverified 00 Learning Visual Representations with Caption Annotations Aug 4, 2020 Image Captioning Language Modeling
— Unverified 00 Learning Word Embeddings for Low-Resource Languages by PU Learning Jun 1, 2018 Document Ranking Image Captioning
— Unverified 00 Let's Go Shopping (LGS) -- Web-Scale Image-Text Dataset for Visual Concept Understanding Jan 9, 2024 Image Captioning image-classification
— Unverified 00 "Let's not Quote out of Context": Unified Vision-Language Pretraining for Context Assisted Image Captioning Jun 1, 2023 Image Captioning Keyword Extraction
— Unverified 00 Leveraging Partial Dependency Trees to Control Image Captions Jun 1, 2021 Image Captioning
— Unverified 00 Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range Voting Aug 17, 2019 Image Captioning Language Modeling
— Unverified 00 Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-Modal Knowledge Transfer Nov 16, 2021 Image Captioning Language Modeling
— Unverified 00 Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-modal Knowledge Transfer Mar 14, 2022 Image Captioning Language Modeling
— Unverified 00 Lexical Simplification with the Deep Structured Similarity Model Nov 1, 2017 Image Captioning Learning Word Embeddings
— Unverified 00 LG-VQ: Language-Guided Codebook Learning May 23, 2024 Image Captioning Image Generation
— Unverified 00 Light as Deception: GPT-driven Natural Relighting Against Vision-Language Pre-training Models May 30, 2025 Image Captioning Question Answering
— Unverified 00 Lightweight In-Context Tuning for Multimodal Unified Models Oct 8, 2023 Image Captioning In-Context Learning
— Unverified 00 Linguistically-aware Attention for Reducing the Semantic-Gap in Vision-Language Tasks Aug 18, 2020 Image Captioning Visual Question Answering (VQA)
— Unverified 00 利用图像描述与知识图谱增强表示的视觉问答(Exploiting Image Captions and External Knowledge as Representation Enhancement for Visual Question Answering) Aug 1, 2021 Image Captioning Question Answering
— Unverified 00 LLaMA-Excitor: General Instruction Tuning via Indirect Feature Interaction Apr 1, 2024 Image Captioning Instruction Following
— Unverified 00 LLARVA: Vision-Action Instruction Tuning Enhances Robot Learning Jun 17, 2024 Image Captioning Question Answering
— Unverified 00 LLM4VG: Large Language Models Evaluation for Video Grounding Dec 21, 2023 Image Captioning Video Grounding
— Unverified 00 LLMs Can Check Their Own Results to Mitigate Hallucinations in Traffic Understanding Tasks Sep 19, 2024 Autonomous Driving Hallucination
— Unverified 00 LocCa: Visual Pretraining with Location-aware Captioners Mar 28, 2024 Decoder Image Captioning
— Unverified 00 Longer Version for "Deep Context-Encoding Network for Retinal Image Captioning" May 30, 2021 Avg Decoder
— Unverified 00 Long-Tail Classification for Distinctive Image Captioning: A Simple yet Effective Remedy for Side Effects of Reinforcement Learning Jan 16, 2022 Image Captioning Reinforcement Learning (RL)
— Unverified 00 Look Back and Predict Forward in Image Captioning Jun 1, 2019 Decoder Image Captioning
— Unverified 00 Look Deeper See Richer: Depth-aware Image Paragraph Captioning Oct 15, 2018 Decoder Image Captioning
— Unverified 00 LookupViT: Compressing visual information to a limited number of tokens Jul 17, 2024 Image Captioning image-classification
— Unverified 00 Lost in Translation: When GPT-4V(ision) Can't See Eye to Eye with Text. A Vision-Language-Consistency Analysis of VLLMs and Beyond Oct 19, 2023 Image Captioning Language Modeling
— Unverified 00 LVLM_CSP: Accelerating Large Vision Language Models via Clustering, Scattering, and Pruning for Reasoning Segmentation Apr 15, 2025 Image Captioning Question Answering
— Unverified 00 Lyrics: Boosting Fine-grained Language-Vision Alignment and Comprehension via Semantic-aware Visual Objects Dec 8, 2023 Image Captioning object-detection
— Unverified 00 M3D-GAN: Multi-Modal Multi-Domain Translation with Universal Attention Jul 9, 2019 Dialogue Generation Image Captioning
— Unverified 00 Macroscopic Control of Text Generation for Image Captioning Jan 20, 2021 Diversity Image Captioning
— Unverified 00 MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning Dec 13, 2021 Caption Generation Descriptive
— Unverified 00 MAGNet: Multi-Region Attention-Assisted Grounding of Natural Language Queries at Phrase Level Jun 6, 2020 Attribute Image Captioning
— Unverified 00 Making the Most of What You Have: Adapting Pre-trained Visual Language Models in the Low-data Regime May 3, 2023 Image Captioning Question Answering
— Unverified 00 Making Use of Latent Space in Language GANs for Generating Diverse Text without Pre-training Apr 1, 2021 Diversity Image Captioning
— Unverified 00 MAMI: Multi-Attentional Mutual-Information for Long Sequence Neuron Captioning Jan 5, 2024 Decoder Image Captioning
— Unverified 00 Mapping Images to Sentiment Adjective Noun Pairs with Factorized Neural Nets Nov 21, 2015 Image Captioning
— Unverified 00 Mask-aware Text-to-Image Retrieval: Referring Expression Segmentation Meets Cross-modal Retrieval Jun 28, 2025 Cross-Modal Retrieval Image Captioning
— Unverified 00 Mask-free OVIS: Open-Vocabulary Instance Segmentation without Manual Mask Annotations Mar 29, 2023 Image Captioning Instance Segmentation
— Unverified 00 MAT: A Multimodal Attentive Translator for Image Captioning Feb 18, 2017 Caption Generation Image Captioning
— Unverified 00 Maybe you are looking for CroQS: Cross-modal Query Suggestion for Text-to-Image Retrieval Dec 18, 2024 Cross-Modal Retrieval Image Captioning
— Unverified 00 Measuring directional bias amplification in image captions using predictability Mar 10, 2025 Image Captioning image-classification
— Unverified 00 Measuring Machine Intelligence Through Visual Question Answering Aug 31, 2016 Image Captioning Question Answering
— Unverified 00 Measuring Representational Harms in Image Captioning Jun 14, 2022 Fairness Image Captioning
— Unverified 00 MedBLIP: Fine-tuning BLIP for Medical Image Captioning May 20, 2025 Decoder Image Captioning
— Unverified 00 Medical Image Captioning via Generative Pretrained Transformers Sep 28, 2022 Caption Generation Descriptive
— Unverified 00 MedXpertQA: Benchmarking Expert-Level Medical Reasoning and Understanding Jan 30, 2025 Benchmarking Decision Making
— Unverified 00 MetaToken: Detecting Hallucination in Image Descriptions by Meta Classification May 29, 2024 Hallucination Image Captioning
— Unverified 00 Metropolis-Hastings Captioning Game: Knowledge Fusion of Vision Language Models via Decentralized Bayesian Inference Apr 13, 2025 Bayesian Inference Image Captioning
— Unverified 00