SOTAVerified

Dense Captioning

Papers

Showing 5169 of 69 papers

TitleStatusHype
Semantic-Aware Pretraining for Dense Video Captioning0
Team RUC_AIM3 Technical Report at Activitynet 2020 Task 2: Exploring Sequential Events Detection for Dense Video Captioning0
Trimmed Action Recognition, Dense-Captioning Events in Videos, and Spatio-temporal Action Localization with Focus on ActivityNet Challenge 20190
UniT3D: A Unified Transformer for 3D Dense Captioning and Visual Grounding0
Visually Grounded Word Embeddings and Richer Visual Features for Improving Multimodal Neural Machine Translation0
xGen-VideoSyn-1: High-fidelity Text-to-Video Synthesis with Compressed Representations0
YH Technologies at ActivityNet Challenge 20180
RUC+CMU: System Report for Dense Captioning Events in Videos0
SAVCHOI: Detecting Suspicious Activities using Dense Video Captioning with Human Object Interactions0
Scan2Cap: Context-aware Dense Captioning in RGB-D Scans0
Scene-LLM: Extending Language Model for 3D Visual Understanding and Reasoning0
Rethinking 3D Dense Caption and Visual Grounding in A Unified Framework through Prompt-based LocalizationCode0
Joint Event Detection and Description in Continuous Video StreamsCode0
DenseCap: Fully Convolutional Localization Networks for Dense CaptioningCode0
IIITD-20K: Dense captioning for Text-Image ReIDCode0
Details Make a Difference: Object State-Sensitive Neurorobotic Task PlanningCode0
Dense Captioning with Joint Inference and Visual ContextCode0
A Hierarchical Approach for Generating Descriptive Image ParagraphsCode0
PaveCap: The First Multimodal Framework for Comprehensive Pavement Condition Assessment with Dense Captioning and PCI EstimationCode0
Show:102550
← PrevPage 3 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ControlCapmAP18.2Unverified
2GRiT (ViT-B)mAP15.5Unverified
3CAG-NetmAP10.5Unverified
4FCLNmAP5.4Unverified