| Sequence to Sequence - Video to Text | Dec 1, 2015 | Caption GenerationLanguage Modeling | —Unverified | 0 | 0 |
| Set Prediction Guided by Semantic Concepts for Diverse Video Captioning | Dec 25, 2023 | Caption GenerationDiversity | —Unverified | 0 | 0 |
| Simultaneous Segmentation and Recognition: Towards more accurate Ego Gesture Recognition | Sep 18, 2019 | Activity RecognitionCaption Generation | —Unverified | 0 | 0 |
| Skip-Gram − Zipf + Uniform = Vector Additivity | Jul 1, 2017 | Caption GenerationDimensionality Reduction | —Unverified | 0 | 0 |
| SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Oct 12, 2024 | AudioCapsAudio captioning | —Unverified | 0 | 0 |
| Social Media Ready Caption Generation for Brands | Jan 3, 2024 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Soft + Hardwired Attention: An LSTM Framework for Human Trajectory Prediction and Abnormal Event Detection | Feb 18, 2017 | Caption GenerationEvent Detection | —Unverified | 0 | 0 |
| Spatio-Temporal Dynamics and Semantic Attribute Enriched Visual Encoding for Video Captioning | Feb 27, 2019 | AttributeCaption Generation | —Unverified | 0 | 0 |
| Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning | Feb 8, 2023 | Caption GenerationDecoder | —Unverified | 0 | 0 |
| Stack-VS: Stacked Visual-Semantic Attention for Image Caption Generation | Sep 5, 2019 | AttributeCaption Generation | —Unverified | 0 | 0 |
| Structural and Functional Decomposition for Personality Image Captioning in a Communication Game | Nov 17, 2020 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| StyleNet: Generating Attractive Visual Captions With Styles | Jul 1, 2017 | Caption Generation | —Unverified | 0 | 0 |
| Taming Encoder for Zero Fine-tuning Image Customization with Text-to-Image Diffusion Models | Apr 5, 2023 | Caption GenerationImage Generation | —Unverified | 0 | 0 |
| Temporal Knowledge-Aware Image Captioning | Nov 16, 2021 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Temporal Object Captioning for Street Scene Videos from LiDAR Tracks | May 22, 2025 | Caption GenerationVideo Captioning | —Unverified | 0 | 0 |
| THE DCASE 2021 CHALLENGE TASK 6 SYSTEM: AUTOMATED AUDIO CAPTIONING WITH WEAKLY SUPERVISED PRE-TRAING AND WORD SELECTION METHODS | Jul 6, 2021 | Audio captioningCaption Generation | —Unverified | 0 | 0 |
| The NTT DCASE2020 Challenge Task 6 system: Automated Audio Captioning with Keywords and Sentence Length Estimation | Jul 1, 2020 | Audio captioningCaption Generation | —Unverified | 0 | 0 |
| The Solution for the ICCV 2023 1st Scientific Figure Captioning Challenge | Mar 26, 2024 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| The Use of Object Labels and Spatial Prepositions as Keywords in a Web-Retrieval-Based Image Caption Generation System | Apr 1, 2017 | Caption GenerationImage Retrieval | —Unverified | 0 | 0 |
| Time Series Language Model for Descriptive Caption Generation | Jan 3, 2025 | Caption GenerationDenoising | —Unverified | 0 | 0 |
| TimeSoccer: An End-to-End Multimodal Large Language Model for Soccer Commentary Generation | Apr 24, 2025 | Caption GenerationDense Video Captioning | —Unverified | 0 | 0 |
| Topic Scene Graph Generation by Attention Distillation From Caption | Jan 1, 2021 | Caption GenerationGraph Generation | —Unverified | 0 | 0 |
| TPsgtR: Neural-Symbolic Tensor Product Scene-Graph-Triplet Representation for Image Captioning | Nov 22, 2019 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Uncertainty-Aware Image Captioning | Nov 30, 2022 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Understanding How Paper Writers Use AI-Generated Captions in Figure Caption Writing | Jan 10, 2025 | Caption Generation | —Unverified | 0 | 0 |