SOTAVerified

Image to text

Papers

Showing 6170 of 246 papers

TitleStatusHype
Brain Captioning: Decoding human brain activity into images and textCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Linearly Mapping from Image to Text SpaceCode1
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
Safe Latent Diffusion: Mitigating Inappropriate Degeneration in Diffusion ModelsCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
See or Guess: Counterfactually Regularized Image CaptioningCode1
UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and GenerationCode1
Pragmatic Radiology Report GenerationCode0
Show:102550
← PrevPage 7 of 25Next →

No leaderboard results yet.