SOTAVerified

Image-text Retrieval

Papers

Showing 201210 of 248 papers

TitleStatusHype
CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval0
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training BenchmarkCode0
BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and GenerationCode5
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text Retrieval0
Unified Multimodal Pre-training and Prompt-based Tuning for Vision-Language Understanding and Generation0
UFO: A UniFied TransfOrmer for Vision-Language Representation Learning0
Constructing Phrase-level Semantic Labels to Form Multi-GrainedSupervision for Image-Text Retrieval0
SwAMP: Swapped Assignment of Multi-Modal Pairs for Cross-Modal Retrieval0
FILIP: Fine-grained Interactive Language-Image Pre-TrainingCode1
Negative Sample is Negative in Its Own Way: Tailoring Negative Sentences for Image-Text RetrievalCode0
Show:102550
← PrevPage 21 of 25Next →

No leaderboard results yet.