| RealignDiff: Boosting Text-to-Image Diffusion Model with Coarse-to-fine Semantic Re-alignment | May 31, 2023 | Caption GenerationLanguage Modelling | —Unverified | 0 | 0 |
| Bringing back simplicity and lightliness into neural image captioning | Oct 15, 2018 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| CapText: Large Language Model-based Caption Generation From Image Context and Description | Jun 1, 2023 | Caption GenerationImage to text | —Unverified | 0 | 0 |
| Caption Generation of Robot Behaviors based on Unsupervised Learning of Action Segments | Mar 23, 2020 | Caption GenerationChunking | —Unverified | 0 | 0 |
| Chittron: An Automatic Bangla Image Captioning System | Sep 2, 2018 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Clue: Cross-modal Coherence Modeling for Caption Generation | May 2, 2020 | Caption Generationcontrollable image captioning | —Unverified | 0 | 0 |
| Common Subspace for Model and Similarity: Phrase Learning for Caption Generation From Images | Dec 1, 2015 | Caption GenerationDescriptive | —Unverified | 0 | 0 |
| Controlled Caption Generation for Images Through Adversarial Attacks | Jul 7, 2021 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Cortico-cerebellar networks as decoupled neural interfaces | Jan 1, 2021 | Caption Generation | —Unverified | 0 | 0 |
| CoVLA: Comprehensive Vision-Language-Action Dataset for Autonomous Driving | Aug 19, 2024 | Autonomous DrivingCaption Generation | —Unverified | 0 | 0 |