| Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning | Mar 18, 2024 | 3D Question Answering (3D-QA)Dense Captioning | —Unverified | 0 |
| Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based Localization | Apr 17, 2024 | 3D dense captioning3D visual grounding | CodeCode Available | 0 |
| Joint Event Detection and Description in Continuous Video Streams | Feb 28, 2018 | Dense CaptioningDense Video Captioning | CodeCode Available | 0 |
| DenseCap: Fully Convolutional Localization Networks for Dense Captioning | Nov 24, 2015 | Dense CaptioningImage Captioning | CodeCode Available | 0 |
| IIITD-20K: Dense captioning for Text-Image ReID | May 8, 2023 | Dense Captioning | CodeCode Available | 0 |
| Details Make a Difference: Object State-Sensitive Neurorobotic Task Planning | Jun 14, 2024 | Dense CaptioningObject | CodeCode Available | 0 |
| Dense Captioning with Joint Inference and Visual Context | Nov 21, 2016 | Dense CaptioningDescriptive | CodeCode Available | 0 |
| A Hierarchical Approach for Generating Descriptive Image Paragraphs | Nov 20, 2016 | Dense CaptioningDescriptive | CodeCode Available | 0 |
| PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI Estimation | Aug 7, 2024 | DecoderDense Captioning | CodeCode Available | 0 |