SOTAVerified

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Showing 125 of 115 papers

TitleStatusHype
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion ModelsCode3
FlipSketch: Flipping Static Drawings to Text-Guided Sketch AnimationsCode3
Alfie: Democratising RGBA Image Generation With No $Code2
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and GenerationCode2
Intelligent Grimm -- Open-ended Visual Storytelling via Latent Diffusion ModelsCode2
Animate-A-Story: Storytelling with Retrieval-Augmented Video GenerationCode2
Gorgeous: Create Your Desired Character Facial Makeup from Any IdeasCode1
inkn'hue: Enhancing Manga Colorization from Multiple Priors with Alignment Multi-Encoder VAECode1
StoryReasoning Dataset: Using Chain-of-Thought for Scene Understanding and Grounded Story GenerationCode1
Positional Diffusion: Ordering Unordered Sets with Diffusion Probabilistic ModelsCode1
TouchStone: Evaluating Vision-Language Models by Language ModelsCode1
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual StorytellingCode1
Expressive Scene Graph Generation Using Commonsense Knowledge Infusion for Visual Understanding and ReasoningCode1
Plot and Rework: Modeling Storylines for Visual StorytellingCode1
Multimodal Large Language Models and Tunings: Vision, Language, Sensors, Audio, and BeyondCode0
No Metrics Are Perfect: Adversarial Reward Learning for Visual StorytellingCode0
Knowledge-Enriched Visual StorytellingCode0
Knowledgeable Storyteller: A Commonsense-Driven Generative Model for Visual StorytellingCode0
Learning to Rank Visual Stories From Human Ranking DataCode0
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and RepetitionCode0
Contextualize, Show and Tell: A Neural Visual StorytellerCode0
GROOViST: A Metric for Grounding Objects in Visual StorytellingCode0
Informative Visual Storytelling with Cross-modal RulesCode0
Detecting and Grounding Important Characters in Visual StoriesCode0
Consistent Story Generation with Asymmetry Zigzag SamplingCode0
Show:102550
← PrevPage 1 of 5Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GLAC NetMETEOR30.14Unverified
2HEGRBLEU-416.7Unverified
3HBSGBLEU-415.4Unverified
4IRWBLEU-415.4Unverified
5CoVSBLEU-415.2Unverified
6SGEmbBLEU-414.8Unverified
7SentiStoryBLEU-414.8Unverified
8SGVSTBLEU-414.7Unverified
9INetBLEU-414.7Unverified
10TAVST (RL)BLEU-414.6Unverified