SOTAVerified

Image-text Retrieval

Papers

Showing 221230 of 248 papers

TitleStatusHype
CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval0
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval0
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation0
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning0
Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval0
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval0
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text RetrievalCode0
Constructing Phrase-level Semantic Labels to Form Multi-Grained Supervision for Image-Text Retrieval0
Multi-stage Pre-training over Simplified Multimodal Pre-training ModelsCode0
Show:102550
← PrevPage 23 of 25Next →

No leaderboard results yet.