SOTAVerified

Caption Generation

Papers

Showing 121130 of 310 papers

TitleStatusHype
Stacked Cross-modal Feature Consolidation Attention Networks for Image Captioning0
Transform, Contrast and Tell: Coherent Entity-Aware Multi-Image CaptioningCode0
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationCode1
Uncertainty-Aware Image Captioning0
Retrieval-Augmented Multimodal Language Modeling0
Visual Commonsense-aware Representation Network for Video CaptioningCode1
Event and Entity Extraction from Generated Video CaptionsCode0
Image Caption Generation for Low-Resource Assamese Language0
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive PruningCode1
Generating image captions with external encyclopedic knowledge0
Show:102550
← PrevPage 13 of 31Next →

No leaderboard results yet.