| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 |
| Event and Entity Extraction from Generated Video Captions | Nov 5, 2022 | Caption GenerationDense Video Captioning | CodeCode Available | 0 |
| Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning | Dec 6, 2017 | Caption GenerationDecoder | CodeCode Available | 0 |
| Memeify: A Large-Scale Meme Generation System | Oct 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 |
| Sequence to Sequence -- Video to Text | May 3, 2015 | Caption GenerationLanguage Modeling | CodeCode Available | 0 |
| Image Captioning with Deep Bidirectional LSTMs | Apr 4, 2016 | Caption GenerationData Augmentation | CodeCode Available | 0 |
| ViPE: Visualise Pretty-much Everything | Oct 16, 2023 | Caption GenerationFigurative Language Visualization | CodeCode Available | 0 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 |
| Image Caption Generation for News Articles | Dec 1, 2020 | ArticlesCaption Generation | CodeCode Available | 0 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 |