| Neural Caption Generation for News Images | May 1, 2018 | Caption GenerationImage Captioning | —Unverified | 0 |
| Local Information Assisted Attention-free Decoder for Audio Captioning | Jan 10, 2022 | Audio captioningCaption Generation | CodeCode Available | 0 |
| Comparative evaluation of CNN architectures for Image Caption Generation | Feb 23, 2021 | Caption GenerationObject Recognition | CodeCode Available | 0 |
| An Actor-Critic Algorithm for Sequence Prediction | Jul 24, 2016 | Caption GenerationMachine Translation | CodeCode Available | 0 |
| LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation | Sep 4, 2021 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGS | Nov 15, 2021 | AudioCapsAudio captioning | CodeCode Available | 0 |
| SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning | Jun 6, 2023 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Tensor Product Generation Networks for Deep NLP Modeling | Sep 26, 2017 | Caption Generation | CodeCode Available | 0 |
| Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens | Jun 19, 2024 | Caption Generationimage-classification | CodeCode Available | 0 |
| Efficient Urdu Caption Generation using Attention based LSTM | Aug 2, 2020 | Caption Generation | CodeCode Available | 0 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 |
| Event and Entity Extraction from Generated Video Captions | Nov 5, 2022 | Caption GenerationDense Video Captioning | CodeCode Available | 0 |
| Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning | Dec 6, 2017 | Caption GenerationDecoder | CodeCode Available | 0 |
| Memeify: A Large-Scale Meme Generation System | Oct 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 |
| Sequence to Sequence -- Video to Text | May 3, 2015 | Caption GenerationLanguage Modeling | CodeCode Available | 0 |
| Image Captioning with Deep Bidirectional LSTMs | Apr 4, 2016 | Caption GenerationData Augmentation | CodeCode Available | 0 |
| ViPE: Visualise Pretty-much Everything | Oct 16, 2023 | Caption GenerationFigurative Language Visualization | CodeCode Available | 0 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 |
| Image Caption Generation for News Articles | Dec 1, 2020 | ArticlesCaption Generation | CodeCode Available | 0 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 |
| Multi-LLM Collaborative Caption Generation in Scientific Documents | Jan 5, 2025 | Caption GenerationImage to text | CodeCode Available | 0 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 |
| Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation | Jun 20, 2017 | Caption Generation | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |