| Describing Multimedia Content using Attention-based Encoder--Decoder Networks | Jul 4, 2015 | Caption GenerationDecoder | —Unverified | 0 |
| Describing Natural Images Containing Novel Objects with Knowledge Guided Assitance | Oct 17, 2017 | Caption Generation | —Unverified | 0 |
| Caption Generation on Scenes with Seen and Unseen Object Categories | Aug 13, 2021 | Caption GenerationLanguage Modelling | —Unverified | 0 |
| DiffCap: Exploring Continuous Diffusion on Image Captioning | May 20, 2023 | Caption GenerationDiversity | —Unverified | 0 |
| DIR: Retrieval-Augmented Image Captioning with Comprehensive Understanding | Dec 2, 2024 | Caption GenerationDomain Generalization | —Unverified | 0 |
| Diverse and Accurate Image Description Using a Variational Auto-Encoder with an Additive Gaussian Encoding Space | Nov 19, 2017 | Caption GenerationImage Description | —Unverified | 0 |
| Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? | Jun 20, 2024 | Caption GenerationHallucination | —Unverified | 0 |
| Do Large Multimodal Models Solve Caption Generation for Scientific Figures? Lessons Learned from SCICAP Challenge 2023 | Jan 31, 2025 | ArticlesCaption Generation | —Unverified | 0 |
| Domain Adaptation for Neural Networks by Parameter Augmentation | Jul 1, 2016 | Caption GenerationDomain Adaptation | —Unverified | 0 |
| DS@BioMed at ImageCLEFmedical Caption 2024: Enhanced Attention Mechanisms in Medical Caption Generation through Concept Detection Integration | Jun 1, 2024 | Caption GenerationImage Captioning | —Unverified | 0 |