SOTAVerified

Dense Captioning

Papers

Showing 1120 of 69 papers

TitleStatusHype
ComiCap: A VLMs pipeline for dense captioning of Comic PanelsCode1
Integrating Visuospatial, Linguistic, and Commonsense Structure into Story VisualizationCode1
End-to-End 3D Dense Captioning with Vote2Cap-DETRCode1
Integrating Visuospatial, Linguistic and Commonsense Structure into Story VisualizationCode1
MORE: Multi-Order RElation Mining for Dense Captioning in 3D ScenesCode1
Dense-Captioning Events in VideosCode1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Context-Aware Alignment and Mutual Masking for 3D-Language Pre-TrainingCode1
3D Vision and Language Pretraining with Large-Scale Synthetic DataCode1
Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020Code1
Show:102550
← PrevPage 2 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified