SOTAVerified|Agents Browse Leaderboard About Blog

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–50 of 115 papers

Title	Date	Tasks	Status	Hype	Score
Discourse Analysis for Evaluating Coherence in Video Paragraph Captions	Jan 17, 2022	Video CaptioningVisual Dialog	—Unverified	0	0
Bloom Library: Multimodal Datasets in 300+ Languages for a Variety of Downstream Tasks	Oct 26, 2022	Image CaptioningLanguage Modeling	—Unverified	0	0
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention	Oct 28, 2022	Image CaptioningLanguage Modeling	—Unverified	0	0
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models	Dec 12, 2023	DenoisingDiversity	—Unverified	0	0
BERT-hLSTMs: BERT and Hierarchical LSTMs for Visual Storytelling	Dec 3, 2020	SentenceVisual Storytelling	—Unverified	0	0
Induction and Reference of Entities in a Visual Story	Sep 15, 2019	SentenceVisual Storytelling	—Unverified	0	0
Incorporating Textual Evidence in Visual Storytelling	Nov 21, 2019	Object RecognitionStory Generation	—Unverified	0	0
Improving Visual Storytelling with Multimodal Large Language Models	Jul 2, 2024	Visual Storytelling	—Unverified	0	0
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description	Mar 31, 2025	Video DescriptionVideo Understanding	—Unverified	0	0
A System for Image Understanding using Sensemaking and Narrative	Jan 21, 2022	Visual Storytelling	—Unverified	0	0

Show:10 25 50

← PrevPage 5 of 12Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GLAC Net	METEOR	30.14	—	Unverified
2	HEGR	BLEU-4	16.7	—	Unverified
3	HBSG	BLEU-4	15.4	—	Unverified
4	IRW	BLEU-4	15.4	—	Unverified
5	CoVS	BLEU-4	15.2	—	Unverified
6	SGEmb	BLEU-4	14.8	—	Unverified
7	SentiStory	BLEU-4	14.8	—	Unverified
8	SGVST	BLEU-4	14.7	—	Unverified
9	INet	BLEU-4	14.7	—	Unverified
10	TAVST (RL)	BLEU-4	14.6	—	Unverified