SOTAVerified

Caption Generation

Papers

Showing 3140 of 310 papers

TitleStatusHype
Large-scale Pre-training for Grounded Video Caption GenerationCode1
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption GenerationCode1
End-to-End Dense Video Captioning with Parallel DecodingCode1
Connecting What to Say With Where to Look by Modeling Human Attention TracesCode1
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change CaptioningCode1
EfficientVLM: Fast and Accurate Vision-Language Models via Knowledge Distillation and Modal-adaptive PruningCode1
Croc: Pretraining Large Multimodal Models with Cross-Modal ComprehensionCode1
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsCode1
Grad-CAM++: Improved Visual Explanations for Deep Convolutional NetworksCode1
Controllable Video Captioning with an Exemplar SentenceCode1
Show:102550
← PrevPage 4 of 31Next →

No leaderboard results yet.