| Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models | Aug 21, 2024 | Logical ReasoningMotion Synthesis | —Unverified | 0 |
| Storybooth: Training-free Multi-Subject Consistency for Improved Visual Storytelling | Apr 8, 2025 | Image GenerationText to Image Generation | —Unverified | 0 |
| Storytelling from an Image Stream Using Scene Graphs | Apr 3, 2020 | Story GenerationVisual Storytelling | —Unverified | 0 |
| Storytelling of Photo Stream with Bidirectional Multi-thread Recurrent Neural Network | Jun 2, 2016 | Video CaptioningVisual Storytelling | —Unverified | 0 |
| Stretch-VST: Getting Flexible With Visual Stories | Aug 1, 2021 | SentenceVisual Storytelling | —Unverified | 0 |
| TARN-VIST: Topic Aware Reinforcement Network for Visual Storytelling | Mar 18, 2024 | Image CaptioningVisual Storytelling | —Unverified | 0 |
| Text-Only Training for Visual Storytelling | Aug 17, 2023 | DiversityInformativeness | —Unverified | 0 |
| The Steep Road to Happily Ever After: An Analysis of Current Visual Storytelling Models | Apr 6, 2019 | SurveyVisual Storytelling | —Unverified | 0 |
| Topic Adaptation and Prototype Encoding for Few-Shot Visual Storytelling | Aug 11, 2020 | Meta-LearningVisual Storytelling | —Unverified | 0 |
| Ordered Attention for Coherent Visual Storytelling | Aug 4, 2021 | SentenceVisual Storytelling | —Unverified | 0 |
| Towards Coherent Visual Storytelling with Ordered Image Attention | Nov 16, 2021 | PositionSentence | —Unverified | 0 |
| Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols | Jan 23, 2025 | Motion GenerationText Generation | —Unverified | 0 |
| Transitional Adaptation of Pretrained Models for Visual Storytelling | Jun 19, 2021 | Image CaptioningLanguage Modelling | —Unverified | 0 |
| Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination | Jul 18, 2021 | Visual Storytelling | —Unverified | 0 |
| Using Inter-Sentence Diverse Beam Search to Reduce Redundancy in Visual Storytelling | May 30, 2018 | Image to textSentence | —Unverified | 0 |
| VinaBench: Benchmark for Faithful and Consistent Visual Narratives | Mar 26, 2025 | Visual Storytelling | —Unverified | 0 |
| Vision Transformer Based Model for Describing a Set of Images as a Story | Oct 6, 2022 | Language ModellingSentence | —Unverified | 0 |
| VIST-GPT: Ushering in the Era of Visual Storytelling with LLMs? | Apr 27, 2025 | Visual GroundingVisual Storytelling | —Unverified | 0 |
| Visual Chain of Thought: Bridging Logical Gaps with Multimodal Infillings | May 3, 2023 | Data AugmentationQuestion Answering | —Unverified | 0 |
| Visual Storytelling via Predicting Anchor Word Embeddings in the Stories | Jan 13, 2020 | Visual StorytellingWord Embeddings | —Unverified | 0 |
| Visual Storytelling with Hierarchical BERT Semantic Guidance | Jan 10, 2022 | SentenceText Generation | —Unverified | 0 |
| Visual Storytelling with Question-Answer Plans | Oct 8, 2023 | Visual Storytelling | —Unverified | 0 |
| Visual Writing Prompts: Character-Grounded Story Generation with Curated Image Sequences | Jan 20, 2023 | Coherence EvaluationGrounded language learning | —Unverified | 0 |
| JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent | Jun 21, 2025 | Instruction FollowingLarge Language Model | —Unverified | 0 |
| KAHANI: Culturally-Nuanced Visual Storytelling Pipeline for Non-Western Cultures | Oct 25, 2024 | Story GenerationVisual Storytelling | —Unverified | 0 |