| IIITD-20K: Dense captioning for Text-Image ReID | May 8, 2023 | Dense Captioning | CodeCode Available | 0 | 5 |
| Joint Event Detection and Description in Continuous Video Streams | Feb 28, 2018 | Dense CaptioningDense Video Captioning | CodeCode Available | 0 | 5 |
| PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation | Aug 7, 2024 | DecoderDense Captioning | CodeCode Available | 0 | 5 |
| Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization | Apr 17, 2024 | 3D dense captioning3D visual grounding | CodeCode Available | 0 | 5 |
| Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions | Jul 9, 2024 | Dense Captioningobject-detection | —Unverified | 0 | 0 |
| DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection | Apr 14, 2024 | Dense CaptioningLanguage Modelling | —Unverified | 0 | 0 |
| Describing image focused in cognitive and visual details for visually impaired people: An approach to generating inclusive paragraphs | Feb 10, 2022 | Dense CaptioningImage Captioning | —Unverified | 0 | 0 |
| Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving | Sep 10, 2024 | 3D dense captioningAutonomous Driving | —Unverified | 0 | 0 |
| 3D Spatial Understanding in MLLMs: Disambiguation and Evaluation | Dec 9, 2024 | 3D dense captioning3D visual grounding | —Unverified | 0 | 0 |
| Improving Diversity and Reducing Redundancy in Paragraph Captions | Jul 19, 2020 | DecoderDense Captioning | —Unverified | 0 | 0 |