| Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer Network | Dec 13, 2020 | Caption GenerationDecoder | CodeCode Available | 1 |
| TAP: Text-Aware Pre-training for Text-VQA and Text-Caption | Dec 8, 2020 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| Improving Image Captioning with Better Use of Captions | Jun 21, 2020 | Caption GenerationImage Captioning | CodeCode Available | 1 |
| Say As You Wish: Fine-grained Control of Image Caption Generation with Abstract Scene Graphs | Mar 1, 2020 | AttributeCaption Generation | CodeCode Available | 1 |
| Deep Reinforcement Learning For Sequence to Sequence Models | May 24, 2018 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 1 |
| Grad-CAM++: Improved Visual Explanations for Deep Convolutional Networks | Oct 30, 2017 | 3D Action RecognitionAction Recognition | CodeCode Available | 1 |
| Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption Generation | Aug 17, 2016 | Caption GenerationDecoder | CodeCode Available | 1 |
| Video captioning with recurrent networks based on frame- and video-level features and visual content classification | Dec 9, 2015 | Caption GenerationGeneral Classification | CodeCode Available | 1 |
| Microsoft COCO Captions: Data Collection and Evaluation Server | Apr 1, 2015 | Caption Generation | CodeCode Available | 1 |
| Show, Attend and Tell: Neural Image Caption Generation with Visual Attention | Feb 10, 2015 | Caption GenerationImage Captioning | CodeCode Available | 1 |