SOTAVerified

Image to text

Papers

Showing 101110 of 246 papers

TitleStatusHype
Captions Are Worth a Thousand Words: Enhancing Product Retrieval with Pretrained Image-to-Text Models0
Can MLLMs Perform Text-to-Image In-Context Learning?Code1
Dynamic Traceback Learning for Medical Report Generation0
Benchmarking Large Multimodal Models against Common CorruptionsCode1
CoCoT: Contrastive Chain-of-Thought Prompting for Large Multimodal Models with Multiple Image Inputs0
SyCoCa: Symmetrizing Contrastive Captioners with Attentive Masking for Multimodal Alignment0
Accept the Modality Gap: An Exploration in the Hyperbolic Space0
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
RefineNet: Enhancing Text-to-Image Conversion with High-Resolution and Detail Accuracy through Hierarchical Transformers and Progressive Refinement0
DiffuVST: Narrating Fictional Scenes with Global-History-Guided Denoising Models0
Show:102550
← PrevPage 11 of 25Next →

No leaderboard results yet.