SOTAVerified

Image to text

Papers

Showing 4150 of 246 papers

TitleStatusHype
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionCode1
Multimodal Foundation Models For Echocardiogram InterpretationCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
Vision-Language Dataset DistillationCode1
Unifying Two-Stream Encoders with Transformers for Cross-Modal RetrievalCode1
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
PRIOR: Prototype Representation Joint Learning from Medical Images and ReportsCode1
Bootstrapping Vision-Language Learning with Decoupled Language Pre-trainingCode1
Brain Captioning: Decoding human brain activity into images and textCode1
What You See is What You Read? Improving Text-Image Alignment EvaluationCode1
Show:102550
← PrevPage 5 of 25Next →

No leaderboard results yet.