| Injecting Prior Knowledge into Image Caption Generation | Nov 22, 2019 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Learning Wake-Sleep Recurrent Attention Models | Sep 22, 2015 | Caption GenerationComputational Efficiency | —Unverified | 0 | 0 |
| LLMs in Political Science: Heralding a New Era of Visual Analysis | Feb 29, 2024 | Caption GenerationFace Identification | —Unverified | 0 | 0 |
| LoHoRavens: A Long-Horizon Language-Conditioned Benchmark for Robotic Tabletop Manipulation | Oct 18, 2023 | Caption GenerationInstruction Following | —Unverified | 0 | 0 |
| LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models | Feb 21, 2025 | Caption GenerationVideo Captioning | —Unverified | 0 | 0 |
| Low-hallucination Synthetic Captions for Large-Scale Vision-Language Model Pre-training | Apr 17, 2025 | Caption GenerationHallucination | —Unverified | 0 | 0 |
| LuoJiaHOG: A Hierarchy Oriented Geo-aware Image Caption Dataset for Remote Sensing Image-Text Retrival | Mar 16, 2024 | Caption GenerationImage-text Retrieval | —Unverified | 0 | 0 |
| MAGIC: Multimodal relAtional Graph adversarIal inferenCe for Diverse and Unpaired Text-based Image Captioning | Dec 13, 2021 | Caption GenerationDescriptive | —Unverified | 0 | 0 |
| MAMS: Model-Agnostic Module Selection Framework for Video Captioning | Jan 30, 2025 | Caption GenerationVideo Captioning | —Unverified | 0 | 0 |
| MAT: A Multimodal Attentive Translator for Image Captioning | Feb 18, 2017 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |