| Recurrent Neural Network Regularization | Sep 8, 2014 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 | 5 |
| CNN Fixations: An unraveling approach to visualize the discriminative image regions | Aug 22, 2017 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Memeify: A Large-Scale Meme Generation System | Oct 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation | Sep 4, 2021 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter | Nov 30, 2021 | Caption GenerationRepresentation Learning | CodeCode Available | 0 | 5 |
| Discriminability objective for training descriptive captions | Mar 12, 2018 | Caption GenerationDescriptive | CodeCode Available | 0 | 5 |
| Local Information Assisted Attention-free Decoder for Audio Captioning | Jan 10, 2022 | Audio captioningCaption Generation | CodeCode Available | 0 | 5 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Image Captioning with Deep Bidirectional LSTMs | Apr 4, 2016 | Caption GenerationData Augmentation | CodeCode Available | 0 | 5 |
| Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement | Feb 19, 2018 | Caption GenerationDenoising | CodeCode Available | 0 | 5 |
| Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning | Dec 6, 2017 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Image Caption Generation for News Articles | Dec 1, 2020 | ArticlesCaption Generation | CodeCode Available | 0 | 5 |
| Guiding Long-Short Term Memory for Image Caption Generation | Sep 16, 2015 | Caption Generation | CodeCode Available | 0 | 5 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 | 5 |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Jun 15, 2024 | Caption Generation | CodeCode Available | 0 | 5 |
| DeepDiary: Automatic Caption Generation for Lifelogging Image Streams | Aug 12, 2016 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning | Feb 4, 2023 | Caption GenerationCoherence Evaluation | CodeCode Available | 0 | 5 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 | 5 |
| Referring Expression Object Segmentation with Caption-Aware Consistency | Oct 10, 2019 | Caption GenerationObject | CodeCode Available | 0 | 5 |
| Denoising Large-Scale Image Captioning from Alt-text Data using Content Selection Models | Sep 17, 2021 | Caption GenerationDenoising | —Unverified | 0 | 0 |
| Deep Verifier Networks: Verification of Deep Discriminative Models with Deep Generative Models | Nov 18, 2019 | Anomaly DetectionAutonomous Driving | —Unverified | 0 | 0 |
| Geometry-Entangled Visual Semantic Transformer for Image Captioning | Sep 29, 2021 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Geo-Aware Image Caption Generation | Dec 1, 2020 | Caption GenerationImage Captioning | —Unverified | 0 | 0 |
| Generating Video Description using Sequence-to-sequence Model with Temporal Attention | Dec 1, 2016 | Caption GenerationSentence | —Unverified | 0 | 0 |