| ComiCap: A VLMs pipeline for dense captioning of Comic Panels | Sep 24, 2024 | AttributeDense Captioning | CodeCode Available | 1 |
| PerLA: Perceptive 3D Language Assistant | Nov 29, 2024 | Dense CaptioningGraph Neural Network | CodeCode Available | 1 |
| 3D Vision and Language Pretraining with Large-Scale Synthetic Data | Jul 8, 2024 | Dense CaptioningDiversity | CodeCode Available | 1 |
| MORE: Multi-Order RElation Mining for Dense Captioning in 3D Scenes | Mar 10, 2022 | 3D dense captioningDense Captioning | CodeCode Available | 1 |
| Integrating Visuospatial, Linguistic and Commonsense Structure into Story Visualization | Oct 21, 2021 | Dense CaptioningImage Generation | CodeCode Available | 1 |
| Dense-Captioning Events in Videos: SYSU Submission to ActivityNet Challenge 2020 | Jun 21, 2020 | Dense CaptioningDense Video Captioning | CodeCode Available | 1 |
| End-to-End 3D Dense Captioning with Vote2Cap-DETR | Jan 6, 2023 | 3D dense captioningDecoder | CodeCode Available | 1 |
| Context-Aware Alignment and Mutual Masking for 3D-Language Pre-Training | Jan 1, 2023 | 3D dense captioning3D visual grounding | CodeCode Available | 1 |
| Dense-Captioning Events in Videos | May 2, 2017 | Dense CaptioningRetrieval | CodeCode Available | 1 |
| Enhancing Vision-Language Pre-Training with Jointly Learned Questioner and Dense Captioner | May 19, 2023 | Dense CaptioningImage Captioning | CodeCode Available | 1 |