SOTAVerified

Dense Captioning

Papers

Showing 2650 of 69 papers

TitleStatusHype
PerLA: Perceptive 3D Language AssistantCode1
3DJCG: A Unified Framework for Joint Dense Captioning and Visual Grounding on 3D Point Clouds0
3D Scene Graph Guided Vision-Language Pre-training0
3D Spatial Understanding in MLLMs: Disambiguation and Evaluation0
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes0
Activitynet 2019 Task 3: Exploring Contexts for Dense Captioning Events in Videos0
Best Vision Technologies Submission to ActivityNet Challenge 2018-Task: Dense-Captioning Events in Videos0
Bi-directional Contextual Attention for 3D Dense Captioning0
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining0
CapOnImage: Context-driven Dense-Captioning on Image0
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
Context and Attribute Grounded Dense Captioning0
Contextual Modeling for 3D Dense Captioning on Point Clouds0
D3Net: A Unified Speaker-Listener Architecture for 3D Dense Captioning and Visual Grounding0
Dense Procedure Captioning in Narrated Instructional Videos0
Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs0
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition0
FlexCap: Describe Anything in Images in Controllable Detail0
Fooling Vision and Language Models Despite Localization and Attention Mechanism0
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions0
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving0
Improving Diversity and Reducing Redundancy in Paragraph Captions0
See It All: Contextualized Late Aggregation for 3D Dense Captioning0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified