| Image Representations and New Domains in Neural Image Captioning | Aug 9, 2015 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Image to Bengali Caption Generation Using Deep CNN and Bidirectional Gated Recurrent Unit | Dec 22, 2020 | Caption GenerationDecoder | —Unverified | 0 | 0 |
| Improving Image Captioning with Better Use of Caption | Jul 1, 2020 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Integrating Frequency-Domain Representations with Low-Rank Adaptation in Vision-Language Models | Mar 8, 2025 | Caption GenerationQuestion Answering | —Unverified | 0 | 0 |
| Knowledge Distillation for Efficient Audio-Visual Video Captioning | Jun 16, 2023 | Audio-Visual Video CaptioningCaption Generation | —Unverified | 0 | 0 |
| Knowledge driven Description Synthesis for Floor Plan Interpretation | Mar 15, 2021 | Caption GenerationDescriptive | —Unverified | 0 | 0 |
| Language Production Dynamics with Recurrent Neural Networks | Jul 1, 2018 | Caption GenerationLanguage Modeling | —Unverified | 0 | 0 |
| LaPIG: Cross-Modal Generation of Paired Thermal and Visible Facial Images | Mar 20, 2025 | Caption GenerationDiversity | —Unverified | 0 | 0 |
| Learning a Recurrent Visual Representation for Image Caption Generation | Nov 20, 2014 | Caption GenerationImage Retrieval | —Unverified | 0 | 0 |
| Learning from Massive Human Videos for Universal Humanoid Pose Control | Dec 18, 2024 | Caption GenerationHumanoid Control | —Unverified | 0 | 0 |