SOTAVerified

Dense Captioning

Papers

Showing 4150 of 69 papers

TitleStatusHype
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning0
FlexCap: Describe Anything in Images in Controllable Detail0
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes0
IIITD-20K: Dense captioning for Text-Image ReIDCode0
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining0
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding0
Contextual Modeling for 3D Dense Captioning on Point Clouds0
SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions0
CapOnImage: Context-driven Dense-Captioning on Image0
Semantic-Aware Pretraining for Dense Video Captioning0
Show:102550
← PrevPage 5 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified