SOTAVerified

Dense Captioning

Papers

Showing 1120 of 69 papers

TitleStatusHype
Bi-directional Contextual Attention for 3D Dense Captioning0
PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI EstimationCode0
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions0
3D Vision and Language Pretraining with Large-Scale Synthetic DataCode1
Details Make a Difference: Object State-Sensitive Neurorobotic Task PlanningCode0
Grounded 3D-LLM with Referent TokensCode2
PLLaVA : Parameter-free LLaVA Extension from Images to Videos for Video Dense CaptioningCode4
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based LocalizationCode0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Show:102550
← PrevPage 2 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified