| Local Information Assisted Attention-free Decoder for Audio Captioning | Jan 10, 2022 | Audio captioningCaption Generation | CodeCode Available | 0 |
| AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGS | Nov 15, 2021 | AudioCapsAudio captioning | CodeCode Available | 0 |
| Tensor Product Generation Networks for Deep NLP Modeling | Sep 26, 2017 | Caption Generation | CodeCode Available | 0 |
| LAViTeR: Learning Aligned Visual and Textual Representations Assisted by Image and Caption Generation | Sep 4, 2021 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| SciCap+: A Knowledge Augmented Dataset to Study the Challenges of Scientific Figure Captioning | Jun 6, 2023 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Attacking Visual Language Grounding with Adversarial Examples: A Case Study on Neural Image Captioning | Dec 6, 2017 | Caption GenerationDecoder | CodeCode Available | 0 |
| Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens | Jun 19, 2024 | Caption Generationimage-classification | CodeCode Available | 0 |
| Efficient Urdu Caption Generation using Attention based LSTM | Aug 2, 2020 | Caption Generation | CodeCode Available | 0 |
| Event and Entity Extraction from Generated Video Captions | Nov 5, 2022 | Caption GenerationDense Video Captioning | CodeCode Available | 0 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 |
| ViPE: Visualise Pretty-much Everything | Oct 16, 2023 | Caption GenerationFigurative Language Visualization | CodeCode Available | 0 |
| Sequence to Sequence -- Video to Text | May 3, 2015 | Caption GenerationLanguage Modeling | CodeCode Available | 0 |
| Memeify: A Large-Scale Meme Generation System | Oct 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 |
| 3D CoCa: Contrastive Learners are 3D Captioners | Apr 13, 2025 | 3D dense captioningCaption Generation | CodeCode Available | 0 |
| Image Captioning with Deep Bidirectional LSTMs | Apr 4, 2016 | Caption GenerationData Augmentation | CodeCode Available | 0 |
| Using Artificial Tokens to Control Languages for Multilingual Image Caption Generation | Jun 20, 2017 | Caption Generation | CodeCode Available | 0 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 |
| Image Caption Generation for News Articles | Dec 1, 2020 | ArticlesCaption Generation | CodeCode Available | 0 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 |
| Multi-LLM Collaborative Caption Generation in Scientific Documents | Jan 5, 2025 | Caption GenerationImage to text | CodeCode Available | 0 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 |
| Where to put the Image in an Image Caption Generator | Mar 27, 2017 | Caption GenerationLanguage Modeling | CodeCode Available | 0 |
| SLAM-AAC: Enhancing Audio Captioning with Paraphrasing Augmentation and CLAP-Refine through LLMs | Oct 12, 2024 | AudioCapsAudio captioning | CodeCode Available | 0 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 |
| Discriminability objective for training descriptive captions | Mar 12, 2018 | Caption GenerationDescriptive | CodeCode Available | 0 |
| An Empirical Study of Language CNN for Image Captioning | Dec 21, 2016 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Multi-source weak supervision for saliency detection | Apr 1, 2019 | Caption GenerationSaliency Detection | CodeCode Available | 0 |
| CNN Fixations: An unraveling approach to visualize the discriminative image regions | Aug 22, 2017 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Jun 15, 2024 | Caption Generation | CodeCode Available | 0 |
| Guiding Long-Short Term Memory for Image Caption Generation | Sep 16, 2015 | Caption Generation | CodeCode Available | 0 |
| Deterministic Non-Autoregressive Neural Sequence Modeling by Iterative Refinement | Feb 19, 2018 | Caption GenerationDenoising | CodeCode Available | 0 |
| DeepDiary: Automatic Caption Generation for Lifelogging Image Streams | Aug 12, 2016 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Global Object Proposals for Improving Multi-Sentence Video Descriptions | Jul 18, 2021 | Caption GenerationDense Video Captioning | CodeCode Available | 0 |
| CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter | Nov 30, 2021 | Caption GenerationRepresentation Learning | CodeCode Available | 0 |
| NICGSlowDown: Evaluating the Efficiency Robustness of Neural Image Caption Generation Models | Mar 29, 2022 | Caption Generation | CodeCode Available | 0 |
| Bivariate Beta-LSTM | May 25, 2019 | Caption GenerationDensity Estimation | CodeCode Available | 0 |
| Dual-path Collaborative Generation Network for Emotional Video Captioning | Aug 6, 2024 | Caption GenerationVideo Captioning | CodeCode Available | 0 |
| Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network | Oct 24, 2021 | Caption GenerationDecoder | CodeCode Available | 0 |
| Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image Captioning | Feb 4, 2023 | Caption GenerationCoherence Evaluation | CodeCode Available | 0 |
| Summaries as Captions: Generating Figure Captions for Scientific Documents with Automated Text Summarization | Feb 23, 2023 | Abstractive Text SummarizationCaption Generation | CodeCode Available | 0 |
| From Simple to Professional: A Combinatorial Controllable Image Captioning Agent | Dec 15, 2024 | Caption Generationcontrollable image captioning | CodeCode Available | 0 |
| FigCaps-HF: A Figure-to-Caption Generative Framework and Benchmark with Human Feedback | Jul 20, 2023 | Caption Generation | CodeCode Available | 0 |
| Cortico-cerebellar networks as decoupling neural interfaces | Oct 21, 2021 | Caption Generation | CodeCode Available | 0 |
| Pre-gen metrics: Predicting caption quality metrics without generating captions | Oct 12, 2018 | Caption Generation | CodeCode Available | 0 |
| R^3Net:Relation-embedded Representation Reconstruction Network for Change Captioning | Oct 20, 2021 | Caption GenerationRelation | CodeCode Available | 0 |
| Rˆ3Net:Relation-embedded Representation Reconstruction Network for Change Captioning | Nov 1, 2021 | Caption GenerationRelation | CodeCode Available | 0 |
| Exploring Models and Data for Remote Sensing Image Caption Generation | Dec 21, 2017 | Caption GenerationImage-to-Text Retrieval | CodeCode Available | 0 |
| Recurrent Neural Network Regularization | Sep 8, 2014 | Caption GenerationImage Captioning | CodeCode Available | 0 |
| Referring Expression Object Segmentation with Caption-Aware Consistency | Oct 10, 2019 | Caption GenerationObject | CodeCode Available | 0 |
| Regularizing RNNs for Caption Generation by Reconstructing The Past with The Present | Mar 30, 2018 | Caption GenerationDecoder | CodeCode Available | 0 |