SOTAVerified

Image to text

Papers

Showing 110 of 246 papers

TitleStatusHype
Versatile Diffusion: Text, Images and Variations All in One Diffusion ModelCode6
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
FlowTok: Flowing Seamlessly Across Text and Image TokensCode5
Magma: A Foundation Model for Multimodal AI AgentsCode5
BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language ModelsCode4
Emu: Generative Pretraining in MultimodalityCode3
Evaluating Text-to-Visual Generation with Image-to-Text GenerationCode3
One Transformer Fits All Distributions in Multi-Modal Diffusion at ScaleCode3
Generative Diffusion Models on Graphs: Methods and ApplicationsCode2
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingCode2
Show:102550
← PrevPage 1 of 25Next →

No leaderboard results yet.