| Clue: Cross-modal Coherence Modeling for Caption Generation | May 2, 2020 | Caption Generationcontrollable image captioning | —Unverified | 0 |
| Image Position Prediction in Multimodal Documents | May 1, 2020 | ArticlesCaption Generation | —Unverified | 0 |
| Caption Generation of Robot Behaviors based on Unsupervised Learning of Action Segments | Mar 23, 2020 | Caption GenerationChunking | —Unverified | 0 |
| Video Caption Dataset for Describing Human Actions in Japanese | Mar 10, 2020 | Caption Generation | —Unverified | 0 |
| Fast Image Caption Generation with Position Alignment | Dec 13, 2019 | Caption GenerationDecoder | —Unverified | 0 |
| Injecting Prior Knowledge into Image Caption Generation | Nov 22, 2019 | Caption GenerationImage Captioning | —Unverified | 0 |
| TPsgtR: Neural-Symbolic Tensor Product Scene-Graph-Triplet Representation for Image Captioning | Nov 22, 2019 | Caption GenerationImage Captioning | —Unverified | 0 |
| Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models | Nov 18, 2019 | Anomaly DetectionAutonomous Driving | —Unverified | 0 |
| Multimodal Intelligence: Representation Learning, Information Fusion, and Applications | Nov 10, 2019 | Caption GenerationImage Generation | —Unverified | 0 |
| WAT2019: English-Hindi Translation on Hindi Visual Genome Dataset | Nov 1, 2019 | Caption GenerationTranslation | —Unverified | 0 |