SOTAVerified

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Showing 150 of 115 papers

TitleStatusHype
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion ModelsCode3
FlipSketch: Flipping Static Drawings to Text-Guided Sketch AnimationsCode3
Alfie: Democratising RGBA Image Generation With No $Code2
Animate-A-Story: Storytelling with Retrieval-Augmented Video GenerationCode2
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion ModelsCode2
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and GenerationCode2
inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAECode1
Gorgeous: Create Your Desired Character Facial Makeup from Any IdeasCode1
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual StorytellingCode1
TouchStone: Evaluating Vision-Language Models by Language ModelsCode1
Plot and Rework: Modeling Storylines for Visual StorytellingCode1
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic ModelsCode1
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks0
Discourse Analysis for Evaluating Coherence in Video Paragraph Captions0
Keep it Consistent: Topic-Aware Storytelling from an Image Stream via Iterative Multi-agent Communication0
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling0
A Hierarchical Approach for Visual Storytelling Using Image Description0
Action2Dialogue: Generating Character-Centric Narratives from Scene-Level Prompts0
Knowledge-enriched Attention Network with Group-wise Semantic for Visual Storytelling0
A survey on knowledge-enhanced multimodal learning0
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description0
A System for Image Understanding using Sensemaking and Narrative0
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models0
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention0
Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning0
Diverse and Relevant Visual Storytelling with Scene Graph Embeddings0
Dixit: Interactive Visual Storytelling via Term Manipulation0
Camera Trajectory Generation: A Comprehensive Survey of Methods, Metrics, and Future Directions0
A-CAP: Anticipation Captioning with Commonsense Knowledge0
A Pipeline for Creative Visual Storytelling0
Graph Similarities and Dual Approach for Sequential Text-to-Image Retrieval0
AesopAgent: Agent-driven Evolutionary System on Story-to-Video Production0
JarvisArt: Liberating Human Artistic Creativity via an Intelligent Photo Retouching Agent0
AOG-LSTM: An adaptive attention neural network for visual storytelling0
Generative Visual Communication in the Era of Vision-Language Models0
Generating Visual Stories with Grounded and Coreferent Characters0
Comics for Everyone: Generating Accessible Text Descriptions for Comic Strips0
Commonsense Knowledge Aware Concept Selection For Diverse and Informative Visual Storytelling0
Hide-and-Tell: Learning to Bridge Photo Streams for Visual Storytelling0
Hierarchically-Attentive RNN for Album Summarization and Storytelling0
Hierarchically Structured Reinforcement Learning for Topically Coherent Visual Story Generation0
Hierarchical memory decoder for visual narrating0
Hierarchical Photo-Scene Encoder for Album Storytelling0
Imagine, Reason and Write: Visual Storytelling with Graph Knowledge and Relational Reasoning0
Improving Visual Storytelling with Multimodal Large Language Models0
Incorporating Textual Evidence in Visual Storytelling0
Induction and Reference of Entities in a Visual Story0
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GLAC NetMETEOR30.14Unverified
2HEGRBLEU-416.7Unverified
3HBSGBLEU-415.4Unverified
4IRWBLEU-415.4Unverified
5CoVSBLEU-415.2Unverified
6SGEmbBLEU-414.8Unverified
7SentiStoryBLEU-414.8Unverified
8SGVSTBLEU-414.7Unverified
9INetBLEU-414.7Unverified
10TAVST (RL)BLEU-414.6Unverified