SOTAVerified

Caption Generation

Papers

Showing 1120 of 310 papers

TitleStatusHype
Fine-grained Image Captioning with CLIP RewardCode2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
PPLLaVA: Varied Video Sequence Understanding With Prompt GuidanceCode2
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption GenerationCode1
Conceptual 12M: Pushing Web-Scale Image-Text Pre-Training To Recognize Long-Tail Visual ConceptsCode1
End-to-End Dense Video Captioning with Parallel DecodingCode1
Connecting What to Say With Where to Look by Modeling Human Attention TracesCode1
GL-RG: Global-Local Representation Granularity for Video CaptioningCode1
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change CaptioningCode1
Belief Revision based Caption Re-ranker with Visual Semantic InformationCode1
Show:102550
← PrevPage 2 of 31Next →

No leaderboard results yet.