SOTAVerified

Image to text

Papers

Showing 141150 of 246 papers

TitleStatusHype
Revisiting DETR Pre-training for Object Detection0
Transferable Decoding with Visual Entities for Zero-Shot Image CaptioningCode1
PRIOR: Prototype Representation Joint Learning from Medical Images and ReportsCode1
Towards a Visual-Language Foundation Model for Computational Pathology0
Planting a SEED of Vision in Large Language ModelCode2
PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting0
Bootstrapping Vision-Language Learning with Decoupled Language Pre-trainingCode1
Emu: Generative Pretraining in MultimodalityCode3
MultiQG-TI: Towards Question Generation from Multi-modal SourcesCode0
Zero-shot Nuclei Detection via Visual-Language Pre-trained ModelsCode0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.