SOTAVerified

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Showing 2130 of 115 papers

TitleStatusHype
Generating Visual Stories with Grounded and Coreferent Characters0
Alfie: Democratising RGBA Image Generation With No $Code2
Semantic Alignment for Multimodal Large Language Models0
Story3D-Agent: Exploring 3D Storytelling Visualization with Large Language Models0
Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning0
Openstory++: A Large-scale Dataset and Benchmark for Instance-aware Open-domain Visual StorytellingCode1
ContextualStory: Consistent Visual Storytelling with Spatially-Enhanced and Storyline ContextCode0
Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and RepetitionCode0
Improving Visual Storytelling with Multimodal Large Language Models0
CoMM: A Coherent Interleaved Image-Text Dataset for Multimodal Understanding and GenerationCode2
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GLAC NetMETEOR30.14Unverified
2HEGRBLEU-416.7Unverified
3HBSGBLEU-415.4Unverified
4IRWBLEU-415.4Unverified
5CoVSBLEU-415.2Unverified
6SGEmbBLEU-414.8Unverified
7SentiStoryBLEU-414.8Unverified
8SGVSTBLEU-414.7Unverified
9INetBLEU-414.7Unverified
10TAVST (RL)BLEU-414.6Unverified