SOTAVerified

Dense Captioning

Papers

Showing 4150 of 69 papers

TitleStatusHype
Dense Procedure Captioning in Narrated Instructional Videos0
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding0
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations0
Contextual Modeling for 3D Dense Captioning on Point Clouds0
Context and Attribute Grounded Dense Captioning0
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
3D Scene Graph Guided Vision-Language Pre-training0
CapOnImage: Context-driven Dense-Captioning on Image0
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining0
YH Technologies at ActivityNet Challenge 20180
Show:102550
← PrevPage 5 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified