| Local Information Assisted Attention-free Decoder for Audio Captioning | Jan 10, 2022 | Audio captioningCaption Generation | CodeCode Available | 0 | 5 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network | Aug 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Evaluating and interpreting caption prediction for histopathology images | Jul 8, 2020 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 | 5 |
| CNN Fixations: An unraveling approach to visualize the discriminative image regions | Aug 22, 2017 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 | 5 |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Jun 15, 2024 | Caption Generation | CodeCode Available | 0 | 5 |
| CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter | Nov 30, 2021 | Caption GenerationRepresentation Learning | CodeCode Available | 0 | 5 |
| Image Caption Generation for News Articles | Dec 1, 2020 | ArticlesCaption Generation | CodeCode Available | 0 | 5 |