SOTAVerified

Caption Generation

Papers

Showing 5160 of 310 papers

TitleStatusHype
NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph EnrichmentCode1
Croc: Pretraining Large Multimodal Models with Cross-Modal ComprehensionCode1
Betrayed by Captions: Joint Caption Grounding and Generation for Open Vocabulary Instance SegmentationCode1
Synthio: Augmenting Small-Scale Audio Classification Datasets with Synthetic DataCode1
Human-like Controllable Image Captioning with Verb-specific Semantic RolesCode1
HCQA @ Ego4D EgoSchema Challenge 2024Code1
Will Pre-Training Ever End? A First Step Toward Next-Generation Foundation MLLMs via Self-Improving Systematic CognitionCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
Improving Image Captioning by Leveraging Intra- and Inter-layer Global Representation in Transformer NetworkCode1
Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense CaptioningCode1
Show:102550
← PrevPage 6 of 31Next →

No leaderboard results yet.