SOTAVerified

Dense Captioning

Papers

Showing 2130 of 69 papers

TitleStatusHype
Context-Aware Alignment and Mutual Masking for 3D-Language Pre-TrainingCode1
Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020Code1
Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense CaptionerCode1
Integrating Visuospatial, Linguistic and Commonsense Structure into Story VisualizationCode1
PerLA: Perceptive 3D Language AssistantCode1
TEMPURA: Temporal Event Masked Prediction and Understanding for Reasoning in ActionCode1
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes0
Show:102550
← PrevPage 3 of 7Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified