| Multi-LLM Collaborative Caption Generation in Scientific Documents | Jan 5, 2025 | Caption GenerationImage to text | CodeCode Available | 0 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 |
| Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation | Jun 20, 2017 | Caption Generation | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |
| Where to put the Image in an Image Caption Generator | Mar 27, 2017 | Caption GenerationLanguage Modeling | CodeCode Available | 0 |
| Discriminability objective for training descriptive captions | Mar 12, 2018 | Caption GenerationDescriptive | CodeCode Available | 0 |
| Multi-source weak supervision for saliency detection | Apr 1, 2019 | Caption GenerationSaliency Detection | CodeCode Available | 0 |
| An Empirical Study of Language CNN for Image Captioning | Dec 21, 2016 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Jun 15, 2024 | Caption Generation | CodeCode Available | 0 |