| MICap: A Unified Model for Identity-aware Movie Descriptions | May 19, 2024 | Caption GenerationDecoder | —Unverified | 0 |
| Mind's Eye: A Recurrent Visual Representation for Image Caption Generation | Jun 1, 2015 | Caption GenerationImage Description | —Unverified | 0 |
| Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset | Jun 1, 2022 | Caption Generationimage-classification | —Unverified | 0 |
| Multi-modal Dependency Tree for Video Captioning | Dec 1, 2021 | Caption GenerationDependency Parsing | —Unverified | 0 |
| Multi-Modal Generative Embedding Model | May 29, 2024 | Caption GenerationCross-Modal Retrieval | —Unverified | 0 |
| Multimodal Intelligence: Representation Learning, Information Fusion, and Applications | Nov 10, 2019 | Caption GenerationImage Generation | —Unverified | 0 |
| Multi-modal reward for visual relationships-based image captioning | Mar 19, 2023 | Caption GenerationDeep Reinforcement Learning | —Unverified | 0 |
| Multi-Similarity Contrastive Learning | Jul 6, 2023 | Caption GenerationContrastive Learning | —Unverified | 0 |
| Multi-task Sequence to Sequence Learning | Nov 19, 2015 | Caption GenerationDecoder | —Unverified | 0 |
| Neural Attention Models for Sequence Classification: Analysis and Application to Key Term Extraction and Dialogue Act Detection | Mar 31, 2016 | Caption GenerationClassification | —Unverified | 0 |