| Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning | Mar 18, 2024 | 3D Question Answering (3D-QA)Dense Captioning | —Unverified | 0 |
| FlexCap: Describe Anything in Images in Controllable Detail | Mar 18, 2024 | AttributeDense Captioning | —Unverified | 0 |
| A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes | Mar 12, 2024 | 3D dense captioningDense Captioning | —Unverified | 0 |
| IIITD-20K: Dense captioning for Text-Image ReID | May 8, 2023 | Dense Captioning | CodeCode Available | 0 |
| CapDet: Unifying Dense Captioning and Open-World Detection Pretraining | Mar 4, 2023 | Dense Captioning | —Unverified | 0 |
| UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding | Dec 1, 2022 | 3D dense captioning3D visual grounding | —Unverified | 0 |
| Contextual Modeling for 3D Dense Captioning on Point Clouds | Oct 8, 2022 | 3D dense captioningDense Captioning | —Unverified | 0 |
| SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions | Jul 24, 2022 | Dense CaptioningDense Video Captioning | —Unverified | 0 |
| CapOnImage: Context-driven Dense-Captioning on Image | Apr 27, 2022 | Dense CaptioningDiversity | —Unverified | 0 |
| Semantic-Aware Pretraining for Dense Video Captioning | Apr 13, 2022 | Dense CaptioningDense Video Captioning | —Unverified | 0 |