| AOG-LSTM: An adaptive attention neural network for visual storytelling | Jun 26, 2023 | DecoderVisual Storytelling | —Unverified | 0 |
| Visual Transformation Telling | May 3, 2023 | Dense Video CaptioningVideo Captioning | CodeCode Available | 0 |
| Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings | May 3, 2023 | Data AugmentationQuestion Answering | —Unverified | 0 |
| A-CAP: Anticipation Captioning with Commonsense Knowledge | Apr 13, 2023 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Detecting and Grounding Important Characters in Visual Stories | Mar 30, 2023 | Visual Storytelling | CodeCode Available | 0 |
| Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences | Jan 20, 2023 | Coherence EvaluationGrounded language learning | —Unverified | 0 |
| A survey on knowledge-enhanced multimodal learning | Nov 19, 2022 | Conditional Image GenerationFactual Visual Question Answering | —Unverified | 0 |
| DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention | Oct 28, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks | Oct 26, 2022 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| Vision Transformer Based Model for Describing a Set of Images as a Story | Oct 6, 2022 | Language ModellingSentence | —Unverified | 0 |
| Coherent Visual Storytelling via Parallel Top-Down Visual and Topic Attention | Aug 17, 2022 | DiversitySentence | —Unverified | 0 |
| RoViST: Learning Robust Metrics for Visual Storytelling | Jul 1, 2022 | SentenceText Generation | CodeCode Available | 0 |
| SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling | Jun 16, 2022 | Visual Storytelling | —Unverified | 0 |
| RoViST:Learning Robust Metrics for Visual Storytelling | May 8, 2022 | SentenceText Generation | CodeCode Available | 0 |
| Learning to Rank Visual Stories From Human Ranking Data | May 1, 2022 | Learning-To-RankText Generation | CodeCode Available | 0 |
| Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling | Mar 10, 2022 | DecoderStory Generation | —Unverified | 0 |
| A System for Image Understanding using Sensemaking and Narrative | Jan 21, 2022 | Visual Storytelling | —Unverified | 0 |
| Discourse Analysis for Evaluating Coherence in Video Paragraph Captions | Jan 17, 2022 | Video CaptioningVisual Dialog | —Unverified | 0 |
| Visual Storytelling with Hierarchical BERT Semantic Guidance | Jan 10, 2022 | SentenceText Generation | —Unverified | 0 |
| RoViST: Learning Robust Metrics for Visual Storytelling | Dec 17, 2021 | SentenceText Generation | —Unverified | 0 |
| Learning to Rank Visual Stories From Human Ranking Data | Nov 16, 2021 | Learning-To-RankText Generation | —Unverified | 0 |
| Towards Coherent Visual Storytelling with Ordered Image Attention | Nov 16, 2021 | PositionSentence | —Unverified | 0 |
| Graph Similarities and Dual Approach for Sequential Text-to-Image Retrieval | Sep 29, 2021 | Graph EmbeddingImage Retrieval | —Unverified | 0 |
| Ordered Attention for Coherent Visual Storytelling | Aug 4, 2021 | SentenceVisual Storytelling | —Unverified | 0 |
| Stretch-VST: Getting Flexible With Visual Stories | Aug 1, 2021 | SentenceVisual Storytelling | —Unverified | 0 |
| Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination | Jul 18, 2021 | Visual Storytelling | —Unverified | 0 |
| Transitional Adaptation of Pretrained Models for Visual Storytelling | Jun 19, 2021 | Image CaptioningLanguage Modelling | —Unverified | 0 |
| Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning | May 18, 2021 | DiversityInformativeness | —Unverified | 0 |
| Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling | Feb 5, 2021 | DiversityInformativeness | —Unverified | 0 |
| AESOP: Abstract Encoding of Stories, Objects, and Pictures | Jan 1, 2021 | Story CompletionVisual Storytelling | CodeCode Available | 0 |
| BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling | Dec 3, 2020 | SentenceVisual Storytelling | —Unverified | 0 |
| Diverse and Relevant Visual Storytelling with Scene Graph Embeddings | Nov 1, 2020 | DiversityStory Generation | —Unverified | 0 |
| Reading Between the Lines: Exploring Infilling in Visual Narratives | Oct 26, 2020 | Visual Storytelling | —Unverified | 0 |
| Hierarchical memory decoder for visual narrating | Sep 1, 2020 | DecoderImage Captioning | —Unverified | 0 |
| Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling | Aug 11, 2020 | Meta-LearningVisual Storytelling | —Unverified | 0 |
| Storytelling from an Image Stream Using Scene Graphs | Apr 3, 2020 | Story GenerationVisual Storytelling | —Unverified | 0 |
| Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling | Feb 3, 2020 | Image CaptioningVisual Storytelling | —Unverified | 0 |
| Visual Storytelling via Predicting Anchor Word Embeddings in the Stories | Jan 13, 2020 | Visual StorytellingWord Embeddings | —Unverified | 0 |
| Knowledge-Enriched Visual Storytelling | Dec 3, 2019 | Knowledge GraphsStory Generation | CodeCode Available | 0 |
| Incorporating Textual Evidence in Visual Storytelling | Nov 21, 2019 | Object RecognitionStory Generation | —Unverified | 0 |
| Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication | Nov 11, 2019 | Image CaptioningQuestion Generation | —Unverified | 0 |
| A Hierarchical Approach for Visual Storytelling Using Image Description | Sep 26, 2019 | DecoderImage Description | —Unverified | 0 |
| Character-Centric Storytelling | Sep 17, 2019 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Induction and Reference of Entities in a Visual Story | Sep 15, 2019 | SentenceVisual Storytelling | —Unverified | 0 |
| What Makes A Good Story? Designing Composite Rewards for Visual Storytelling | Sep 11, 2019 | Reinforcement LearningVisual Storytelling | CodeCode Available | 0 |
| ``My Way of Telling a Story'': Persona based Grounded Story Generation | Aug 1, 2019 | DecoderStory Generation | —Unverified | 0 |
| Informative Visual Storytelling with Cross-modal Rules | Jul 7, 2019 | DecoderStory Generation | CodeCode Available | 0 |
| "My Way of Telling a Story": Persona based Grounded Story Generation | Jun 14, 2019 | DecoderStory Generation | —Unverified | 0 |
| Visual Story Post-Editing | Jun 5, 2019 | Visual Storytelling | CodeCode Available | 0 |
| Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling | May 4, 2019 | AI AgentKnowledge Graphs | CodeCode Available | 0 |