SOTAVerified

Image to text

Papers

Showing 131140 of 246 papers

TitleStatusHype
BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification0
Sequential Semantic Generative Communication for Progressive Text-to-Image Generation0
Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction TuningCode2
Multimodal Foundation Models For Echocardiogram InterpretationCode1
Beyond One-to-One: Rethinking the Referring Image SegmentationCode1
Large Multilingual Models Pivot Zero-Shot Multimodal Learning across LanguagesCode6
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training0
Vision-Language Dataset DistillationCode1
Unifying Two-Stream Encoders with Transformers for Cross-Modal RetrievalCode1
Multimodal Neurons in Pretrained Text-Only Transformers0
Show:102550
← PrevPage 14 of 25Next →

No leaderboard results yet.