Multimodal Transformer with Multi-View Visual Representation for Image Captioning May 20, 2019 Decoder Image Captioning
— Unverified 0Multi-Reference Training with Pseudo-References for Neural Translation and Text Generation Aug 28, 2018 Image Captioning Machine Translation
— Unverified 0Multi-view and Cross-view Brain Decoding Oct 1, 2022 Brain Decoding Image Captioning
— Unverified 0MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Jun 26, 2024 Decoder GPU
— Unverified 0MUTT: Metric Unit TesTing for Language Generation Tasks Aug 1, 2016 Image Captioning Machine Translation
— Unverified 0MyVLM: Personalizing VLMs for User-Specific Queries Mar 21, 2024 Image Captioning Language Modelling
— Unverified 0Natural Language Generation Mar 20, 2025 Image Captioning Image to text
— Unverified 0Natural Language Statistical Features of LSTM-generated Texts Apr 10, 2018 Image Captioning Text Generation
— Unverified 0Nemesis: Neural Mean Teacher Learning-Based Emotion-Centric Speaker Feb 9, 2023 Image Captioning
— Unverified 0Netizen-Style Commenting on Fashion Photos: Dataset and Diversity Measures Jan 31, 2018 Cultural Vocal Bursts Intensity Prediction Diversity
— Unverified 0Neural Attention for Image Captioning: Review of Outstanding Methods Nov 29, 2021 Decoder Deep Learning
— Unverified 0Neural Caption Generation for News Images May 1, 2018 Caption Generation Image Captioning
— Unverified 0Neural Headline Generation on Abstract Meaning Representation Nov 1, 2016 Abstract Meaning Representation Dependency Parsing
— Unverified 0Neural Image Captioning Jul 2, 2019 Image Captioning Machine Translation
— Unverified 0Neural Joking Machine : Humorous image captioning May 30, 2018 Image Captioning
— Unverified 0Neural Machine Translation: Basics, Practical Aspects and Recent Trends Nov 1, 2017 Image Captioning Machine Translation
— Unverified 0Learning to Collocate Visual-Linguistic Neural Modules for Image Captioning Oct 4, 2022 Image Captioning Sentence
Code Code Available 0Rethinking the Reference-based Distinctive Image Captioning Jul 22, 2022 Attribute Benchmarking
Code Code Available 0Learning to Evaluate Image Captioning Jun 17, 2018 8k Data Augmentation
Code Code Available 0ADS-Cap: A Framework for Accurate and Diverse Stylized Captioning with Unpaired Stylistic Corpora Aug 2, 2023 Contrastive Learning Diversity
Code Code Available 0Learning like a Child: Fast Novel Visual Concept Learning from Sentence Descriptions of Images Apr 25, 2015 Image Captioning Novel Concepts
Code Code Available 0Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network Oct 24, 2021 Caption Generation Decoder
Code Code Available 0Retrieval Augmentation for Deep Neural Networks Feb 25, 2021 Image Captioning Retrieval
Code Code Available 0Evaluating and interpreting caption prediction for histopathology images Jul 8, 2020 Caption Generation Image Captioning
Code Code Available 0Learning by Correction: Efficient Tuning Task for Zero-Shot Generative Vision-Language Reasoning Apr 1, 2024 Image Captioning Instruction Following
Code Code Available 0The Power of Many: Multi-Agent Multimodal Models for Cultural Image Captioning Nov 18, 2024 Image Captioning
Code Code Available 0Learning Visually-Grounded Semantics from Contrastive Adversarial Samples Jun 27, 2018 Adversarial Attack Image Captioning
Code Code Available 0BAN-Cap: A Multi-Purpose English-Bangla Image Descriptions Dataset May 28, 2022 Image Captioning Machine Translation
Code Code Available 0REVEAL: Retrieval-Augmented Visual-Language Pre-Training with Multi-Source Multimodal Knowledge Memory Dec 10, 2022 Image Captioning Language Modeling
Code Code Available 0Learning a Deep Embedding Model for Zero-Shot Learning Nov 15, 2016 Image Captioning Sentence
Code Code Available 0AlphaX: eXploring Neural Architectures with Deep Neural Networks and Monte Carlo Tree Search Mar 26, 2019 GPU Image Captioning
Code Code Available 0The Role of Syntactic Planning in Compositional Image Captioning Jan 28, 2021 Image Captioning
Code Code Available 0Learn from Downstream and Be Yourself in Multimodal Large Language Model Fine-Tuning Nov 17, 2024 Image Captioning Language Modeling
Code Code Available 0Leveraging Human Attention in Novel Object Captioning Aug 19, 2021 Image Captioning Object
Code Code Available 0Leveraging image captions for selective whole slide image annotation Jul 8, 2024 Diversity Image Captioning
Code Code Available 0Women Wearing Lipstick: Measuring the Bias Between an Object and Its Related Gender Oct 29, 2023 Image Captioning
Code Code Available 0Aligning where to see and what to tell: image caption with region-based attention and scene factorization Jun 20, 2015 Image Captioning
Code Code Available 0Variational Transformer: A Framework Beyond the Trade-off between Accuracy and Diversity for Image Captioning May 28, 2022 Diversity Image Captioning
Code Code Available 0Enhancing Descriptive Image Captioning with Natural Language Inference Aug 1, 2021 Descriptive Image Captioning
Code Code Available 0End-to-End Instance Segmentation with Recurrent Attention May 30, 2016 Autonomous Driving Image Captioning
Code Code Available 0Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT Dec 3, 2023 Caption Generation Decoder
Code Code Available 0Learning to Caption Images through a Lifetime by Asking Questions Dec 1, 2018 Active Learning Image Captioning
Code Code Available 0End-to-end Image Captioning Exploits Distributional Similarity in Multimodal Space Nov 1, 2018 Image Captioning Text Generation
Code Code Available 0End-to-End Attention-based Image Captioning Apr 30, 2021 Image Captioning Translation
Code Code Available 0LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation Sep 4, 2021 Caption Generation Image Captioning
Code Code Available 0Large Vision-Language Models for Knowledge-Grounded Data Annotation of Memes Jan 23, 2025 Emotion Classification Image Captioning
Code Code Available 0LineCap: Line Charts for Data Visualization Captioning Models Jul 15, 2022 Data Visualization Deep Learning
Code Code Available 0ELIP: Efficient Language-Image Pre-training with Fewer Vision Tokens Sep 28, 2023 Cross-Modal Retrieval GPU
Code Code Available 0Language Models as Knowledge Bases for Visual Word Sense Disambiguation Oct 3, 2023 Image Captioning Multiple-choice
Code Code Available 0TIGEr: Text-to-Image Grounding for Image Caption Evaluation Sep 4, 2019 Image Captioning Text Matching
Code Code Available 0