SOTAVerified

Caption Generation

Papers

Showing 251260 of 310 papers

TitleStatusHype
Everything is a Video: Unifying Modalities through Next-Frame Prediction0
Examining the Effects of Language-and-Vision Data Augmentation for Generation of Descriptions of Human Faces0
Explainable Image Captioning using CNN- CNN architecture and Hierarchical Attention0
EzAudio: Enhancing Text-to-Audio Generation with Efficient Diffusion Transformer0
FaceGemma: Enhancing Image Captioning with Facial Attributes for Portrait Images0
Fast, Diverse and Accurate Image Captioning Guided By Part-of-Speech0
Fast Image Caption Generation with Position Alignment0
Feature Fusion Effects of Tensor Product Representation on (De)Compositional Network for Caption Generation for Images0
Less for More: Enhanced Feedback-aligned Mixed LLMs for Molecule Caption Generation and Fine-Grained NLI Evaluation0
FE-LWS: Refined Image-Text Representations via Decoder Stacking and Fused Encodings for Remote Sensing Image Captioning0
Show:102550
← PrevPage 26 of 31Next →

No leaderboard results yet.