SOTAVerified

Dense Captioning

Papers

Showing 3140 of 69 papers

TitleStatusHype
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations0
See It All: Contextualized Late Aggregation for 3D Dense Captioning0
Bi-directional Contextual Attention for 3D Dense Captioning0
PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI EstimationCode0
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions0
Details Make a Difference: Object State-Sensitive Neurorobotic Task PlanningCode0
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based LocalizationCode0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition0
Show:102550
← PrevPage 4 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified