SOTAVerified

Image to text

Papers

Showing 181190 of 246 papers

TitleStatusHype
Retrieval-Augmented Multimodal Language Modeling0
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsCode1
Learning by Hallucinating: Vision-Language Pre-training with Weak Supervision0
Improving the Factual Correctness of Radiology Report Generation with Semantic Rewards0
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text GenerationCode1
Image Semantic Relation Generation0
Pix2Struct: Screenshot Parsing as Pretraining for Visual Language UnderstandingCode2
Cross-modal Contrastive Attention Model for Medical Report Generation0
Linearly Mapping from Image to Text SpaceCode1
Show:102550
← PrevPage 19 of 25Next →

No leaderboard results yet.