SOTAVerified

Visual Storytelling

( Image credit: No Metrics Are Perfect )

Papers

Showing 2130 of 115 papers

TitleStatusHype
GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography0
A survey on knowledge-enhanced multimodal learning0
DANTE-AD: Dual-Vision Attention Network for Long-Term Audio Description0
A System for Image Understanding using Sensemaking and Narrative0
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models0
DiMBERT: Learning Vision-Language Grounded Representations with Disentangled Multimodal-Attention0
Context-aware Visual Storytelling with Visual Prefix Tuning and Contrastive Learning0
Diverse and Relevant Visual Storytelling with Scene Graph Embeddings0
Dixit: Interactive Visual Storytelling via Term Manipulation0
A-CAP: Anticipation Captioning with Commonsense Knowledge0
Show:102550
← PrevPage 3 of 12Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GLAC NetMETEOR30.14Unverified
2HEGRBLEU-416.7Unverified
3HBSGBLEU-415.4Unverified
4IRWBLEU-415.4Unverified
5CoVSBLEU-415.2Unverified
6SGEmbBLEU-414.8Unverified
7SentiStoryBLEU-414.8Unverified
8SGVSTBLEU-414.7Unverified
9INetBLEU-414.7Unverified
10TAVST (RL)BLEU-414.6Unverified