SOTAVerified

Caption Generation

Papers

Showing 1120 of 310 papers

TitleStatusHype
Positive-Augmented Contrastive Learning for Vision-and-Language Evaluation and TrainingCode2
Fine-tuning Multimodal LLMs to Follow Zero-shot Demonstrative InstructionsCode2
AudioSetCaps: An Enriched Audio-Caption Dataset using Automated Generation Pipeline with Large Audio and Language ModelsCode2
Frame- and Segment-Level Features and Candidate Pool Evaluation for Video Caption GenerationCode1
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationCode1
End-to-End Dense Video Captioning with Parallel DecodingCode1
GL-RG: Global-Local Representation Granularity for Video CaptioningCode1
Belief Revision based Caption Re-ranker with Visual Semantic InformationCode1
Deep Reinforcement Learning For Sequence to Sequence ModelsCode1
Distractors-Immune Representation Learning with Cross-modal Contrastive Regularization for Change CaptioningCode1
Show:102550
← PrevPage 2 of 31Next →

No leaderboard results yet.