| Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication | Nov 11, 2019 | Image CaptioningQuestion Generation | —Unverified | 0 |
| Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling | Mar 10, 2022 | DecoderStory Generation | —Unverified | 0 |
| Learning to Rank Visual Stories From Human Ranking Data | Nov 16, 2021 | Learning-To-RankText Generation | —Unverified | 0 |
| LoRAShop: Training-Free Multi-Concept Image Generation and Editing with Rectified Flow Transformers | May 29, 2025 | DenoisingImage Generation | —Unverified | 0 |
| MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising | Dec 18, 2023 | DenoisingImage Generation | —Unverified | 0 |
| Metamorpheus: Interactive, Affective, and Creative Dream Narration Through Metaphorical Visual Storytelling | Mar 1, 2024 | ARCVisual Storytelling | —Unverified | 0 |
| MIRAGE: Multimodal Immersive Reasoning and Guided Exploration for Red-Team Jailbreak Attacks | Mar 24, 2025 | Visual Storytelling | —Unverified | 0 |
| "My Way of Telling a Story": Persona based Grounded Story Generation | Jun 14, 2019 | DecoderStory Generation | —Unverified | 0 |
| ``My Way of Telling a Story'': Persona based Grounded Story Generation | Aug 1, 2019 | DecoderStory Generation | —Unverified | 0 |
| Neural Event Extraction from Movies Description | Jun 1, 2018 | Event ExtractionMachine Translation | —Unverified | 0 |
| On How Users Edit Computer-Generated Visual Stories | Feb 22, 2019 | ArticlesDiversity | —Unverified | 0 |
| Reading Between the Lines: Exploring Infilling in Visual Narratives | Oct 26, 2020 | Visual Storytelling | —Unverified | 0 |
| RoViST: Learning Robust Metrics for Visual Storytelling | Dec 17, 2021 | SentenceText Generation | —Unverified | 0 |
| SCO-VIST: Social Interaction Commonsense Knowledge-based Visual Storytelling | Feb 1, 2024 | DiversityImage Captioning | —Unverified | 0 |
| Semantic Alignment for Multimodal Large Language Models | Aug 23, 2024 | Large Language ModelVisual Storytelling | —Unverified | 0 |
| SentiStory: A Multi-Layered Sentiment-Aware Generative Model for Visual Storytelling | Jun 16, 2022 | Visual Storytelling | —Unverified | 0 |
| Shape2Animal: Creative Animal Generation from Natural Silhouettes | Jun 25, 2025 | Visual Storytelling | —Unverified | 0 |
| Informative Visual Storytelling with Cross-modal Rules | Jul 7, 2019 | DecoderStory Generation | CodeCode Available | 0 |
| GROOViST: A Metric for Grounding Objects in Visual Storytelling | Oct 26, 2023 | Visual GroundingVisual Storytelling | CodeCode Available | 0 |
| GLAC Net: GLocal Attention Cascading Networks for Multi-image Cued Story Generation | May 28, 2018 | SentenceStory Generation | CodeCode Available | 0 |
| FLIP Reasoning Challenge | Apr 16, 2025 | Common Sense Reasoningimage-classification | CodeCode Available | 0 |
| Envisioning Narrative Intelligence: A Creative Visual Storytelling Anthology | Oct 6, 2023 | Story GenerationVisual Storytelling | CodeCode Available | 0 |
| Detecting and Grounding Important Characters in Visual Stories | Mar 30, 2023 | Visual Storytelling | CodeCode Available | 0 |
| Contextualize, Show and Tell: A Neural Visual Storyteller | Jun 3, 2018 | DecoderImage Description | CodeCode Available | 0 |
| Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual Storytelling | May 4, 2019 | AI AgentKnowledge Graphs | CodeCode Available | 0 |