SOTAVerified

Image to text

Papers

Showing 5160 of 246 papers

TitleStatusHype
Multimodal Procedural Planning via Dual Text-Image PromptingCode1
MAGVLT: Masked Generative Vision-and-Language TransformerCode1
ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language GenerationCode1
Towards Unifying Medical Vision-and-Language Pre-training via Soft PromptsCode1
Language Quantized AutoEncoders: Towards Unsupervised Text-Image AlignmentCode1
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsCode1
Visual Spatial Description: Controlled Spatial-Oriented Image-to-Text GenerationCode1
Linearly Mapping from Image to Text SpaceCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
What is Where by Looking: Weakly-Supervised Open-World Phrase-Grounding without Text InputsCode1
Show:102550
← PrevPage 6 of 25Next →

No leaderboard results yet.