SOTAVerified

Image to text

Papers

Showing 2130 of 246 papers

TitleStatusHype
CoMat: Aligning Text-to-Image Diffusion Model with Image-to-Text Concept MatchingCode2
Bootstrapping Vision-Language Learning with Decoupled Language Pre-trainingCode1
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
Improving Image Restoration through Removing Degradations in Textual RepresentationsCode1
DoraCycle: Domain-Oriented Adaptation of Unified Generative Model in Multimodal CyclesCode1
FETA: Towards Specializing Foundation Models for Expert Task ApplicationsCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
Improving Factual Completeness and Consistency of Image-to-Text Radiology Report GenerationCode1
Show:102550
← PrevPage 3 of 25Next →

No leaderboard results yet.