SOTAVerified|Agents Browse Leaderboard About Blog

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–115 of 115 papers

Title	Date	Tasks	Status	Hype	Score
A Comprehensive Survey and Guide to Multimodal Large Language Models in Vision-Language Tasks	Nov 9, 2024	Visual Storytelling	—Unverified	0	0
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description	Mar 31, 2025	Video DescriptionVideo Understanding	—Unverified	0	0
Ordered Attention for Coherent Visual Storytelling	Aug 4, 2021	SentenceVisual Storytelling	—Unverified	0	0
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models	Dec 12, 2023	DenoisingDiversity	—Unverified	0	0
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention	Oct 28, 2022	Image CaptioningLanguage Modeling	—Unverified	0	0
Discourse Analysis for Evaluating Coherence in Video Paragraph Captions	Jan 17, 2022	Video CaptioningVisual Dialog	—Unverified	0	0
Diverse and Relevant Visual Storytelling with Scene Graph Embeddings	Nov 1, 2020	DiversityStory Generation	—Unverified	0	0
Dixit: Interactive Visual Storytelling via Term Manipulation	Mar 6, 2019	DecoderVisual Storytelling	—Unverified	0	0
Towards Coherent Visual Storytelling with Ordered Image Attention	Nov 16, 2021	PositionSentence	—Unverified	0	0
Camera Trajectory Generation: A Comprehensive Survey of Methods, Metrics, and Future Directions	Jun 1, 2025	Visual Storytelling	—Unverified	0	0
Toyteller: AI-powered Visual Storytelling Through Toy-Playing with Character Symbols	Jan 23, 2025	Motion GenerationText Generation	—Unverified	0	0
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks	Oct 26, 2022	Image CaptioningLanguage Modeling	—Unverified	0	0
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography	Apr 9, 2025	Visual Storytelling	—Unverified	0	0
Generating Visual Stories with Grounded and Coreferent Characters	Sep 20, 2024	Story GenerationVisual Storytelling	—Unverified	0	0
Generative Visual Communication in the Era of Vision-Language Models	Nov 27, 2024	Visual Storytelling	—Unverified	0	0

Show:10 25 50

← PrevPage 5 of 5Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GLAC Net	METEOR	30.14	—	Unverified
2	HEGR	BLEU-4	16.7	—	Unverified
3	HBSG	BLEU-4	15.4	—	Unverified
4	IRW	BLEU-4	15.4	—	Unverified
5	CoVS	BLEU-4	15.2	—	Unverified
6	SGEmb	BLEU-4	14.8	—	Unverified
7	SentiStory	BLEU-4	14.8	—	Unverified
8	SGVST	BLEU-4	14.7	—	Unverified
9	INet	BLEU-4	14.7	—	Unverified
10	TAVST (RL)	BLEU-4	14.6	—	Unverified