| Dual-path Collaborative Generation Network for Emotional Video Captioning | Aug 6, 2024 | Caption GenerationVideo Captioning | CodeCode Available | 0 | 5 |
| Pre-gen metrics: Predicting caption quality metrics without generating captions | Oct 12, 2018 | Caption Generation | CodeCode Available | 0 | 5 |
| R^3Net:Relation-embedded Representation Reconstruction Network for Change Captioning | Oct 20, 2021 | Caption GenerationRelation | CodeCode Available | 0 | 5 |
| Multi-source weak supervision for saliency detection | Apr 1, 2019 | Caption GenerationSaliency Detection | CodeCode Available | 0 | 5 |
| Bangla Image Caption Generation through CNN-Transformer based Encoder-Decoder Network | Oct 24, 2021 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Multimodal Preference Data Synthetic Alignment with Reward Model | Dec 23, 2024 | 2kCaption Generation | CodeCode Available | 0 | 5 |
| Enhancing Cross-Prompt Transferability in Vision-Language Models through Contextual Injection of Target Tokens | Jun 19, 2024 | Caption Generationimage-classification | CodeCode Available | 0 | 5 |
| Automatic Report Generation for Histopathology images using pre-trained Vision Transformers and BERT | Dec 3, 2023 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| An Actor-Critic Algorithm for Sequence Prediction | Jul 24, 2016 | Caption GenerationMachine Translation | CodeCode Available | 0 | 5 |
| Multi-LLM Collaborative Caption Generation in Scientific Documents | Jan 5, 2025 | Caption GenerationImage to text | CodeCode Available | 0 | 5 |
| AUTOMATED AUDIO CAPTIONING BY FINE-TUNING BART WITH AUDIOSET TAGS | Nov 15, 2021 | AudioCapsAudio captioning | CodeCode Available | 0 | 5 |
| Compositional Generalization in Image Captioning | Sep 10, 2019 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Memeify: A Large-Scale Meme Generation System | Oct 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Efficient Urdu Caption Generation using Attention based LSTM | Aug 2, 2020 | Caption Generation | CodeCode Available | 0 | 5 |
| Continual Panoptic Perception: Towards Multi-modal Incremental Interpretation of Remote Sensing Images | Jul 19, 2024 | Caption GenerationContinual Learning | CodeCode Available | 0 | 5 |
| Comparative evaluation of CNN architectures for Image Caption Generation | Feb 23, 2021 | Caption GenerationObject Recognition | CodeCode Available | 0 | 5 |
| Local Information Assisted Attention-free Decoder for Audio Captioning | Jan 10, 2022 | Audio captioningCaption Generation | CodeCode Available | 0 | 5 |
| Controllable Video Captioning with POS Sequence Guidance Based on Gated Fusion Network | Aug 27, 2019 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| Evaluating and interpreting caption prediction for histopathology images | Jul 8, 2020 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Mol2Lang-VLM: Vision- and Text-Guided Generative Pre-trained Language Models for Advancing Molecule Captioning through Multimodal Fusion | Aug 15, 2024 | Caption GenerationDecoder | CodeCode Available | 0 | 5 |
| DSD: Dense-Sparse-Dense Training for Deep Neural Networks | Jul 15, 2016 | 8kCaption Generation | CodeCode Available | 0 | 5 |
| CNN Fixations: An unraveling approach to visualize the discriminative image regions | Aug 22, 2017 | Caption GenerationImage Captioning | CodeCode Available | 0 | 5 |
| Journalistic Guidelines Aware News Image Captioning | Sep 7, 2021 | Caption GenerationDescriptive | CodeCode Available | 0 | 5 |
| Humor in AI: Massive Scale Crowd-Sourced Preferences and Benchmarks for Cartoon Captioning | Jun 15, 2024 | Caption Generation | CodeCode Available | 0 | 5 |
| CLIP Meets Video Captioning: Concept-Aware Representation Learning Does Matter | Nov 30, 2021 | Caption GenerationRepresentation Learning | CodeCode Available | 0 | 5 |