SOTAVerified

Dense Captioning

Papers

Showing 2650 of 69 papers

TitleStatusHype
Dense-Captioning Events in VideosCode1
Does Your 3D Encoder Really Work? When Pretrain-SFT from 2D VLMs Meets 3D VLMs0
3D Spatial Understanding in MLLMs: Disambiguation and Evaluation0
3D Scene Graph Guided Vision-Language Pre-training0
Hint-AD: Holistically Aligned Interpretability in End-to-End Autonomous Driving0
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations0
See It All: Contextualized Late Aggregation for 3D Dense Captioning0
Bi-directional Contextual Attention for 3D Dense Captioning0
PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI EstimationCode0
Complete 3d relationships extraction modality alignment network for 3d dense captioning0
Graph-Based Captioning: Enhancing Visual Descriptions by Interconnecting Region Captions0
Details Make a Difference: Object State-Sensitive Neurorobotic Task PlanningCode0
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based LocalizationCode0
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection0
Entity6K: A Large Open-Domain Evaluation Dataset for Real-World Entity Recognition0
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning0
FlexCap: Describe Anything in Images in Controllable Detail0
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes0
IIITD-20K: Dense captioning for Text-Image ReIDCode0
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining0
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding0
Contextual Modeling for 3D Dense Captioning on Point Clouds0
SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions0
CapOnImage: Context-driven Dense-Captioning on Image0
Semantic-Aware Pretraining for Dense Video Captioning0
Show:102550
← PrevPage 2 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified