SOTAVerified

Image to text

Papers

Showing 121130 of 246 papers

TitleStatusHype
Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning0
SingleInsert: Inserting New Concepts from a Single Image into Text-to-Image Models for Flexible Editing0
Symmetrical Linguistic Feature Distillation with CLIP for Scene Text RecognitionCode1
Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API0
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency0
Fine-grained Late-interaction Multi-modal Retrieval for Retrieval Augmented Visual Question AnsweringCode2
Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored SearchCode0
SurrogatePrompt: Bypassing the Safety Filter of Text-to-Image Models via Substitution0
Offline Detection of Misspelled Handwritten Words by Convolving Recognition Model Features with Text Labels0
CLIP-based Synergistic Knowledge Transfer for Text-based Person RetrievalCode0
Show:102550
← PrevPage 13 of 25Next →

No leaderboard results yet.