SOTAVerified

Dense Captioning

Papers

Showing 6169 of 69 papers

TitleStatusHype
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning0
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based LocalizationCode0
Joint Event Detection and Description in Continuous Video StreamsCode0
DenseCap: Fully Convolutional Localization Networks for Dense CaptioningCode0
IIITD-20K: Dense captioning for Text-Image ReIDCode0
Details Make a Difference: Object State-Sensitive Neurorobotic Task PlanningCode0
Dense Captioning with Joint Inference and Visual ContextCode0
A Hierarchical Approach for Generating Descriptive Image ParagraphsCode0
PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI EstimationCode0
Show:102550
← PrevPage 7 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified