SOTAVerified

Image-text Retrieval

Papers

Showing 2130 of 248 papers

TitleStatusHype
DreamLIP: Language-Image Pre-training with Long CaptionsCode2
FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model EvaluationCode2
Med3DVLM: An Efficient Vision-Language Model for 3D Medical Image AnalysisCode2
Scaling Up Visual and Vision-Language Representation Learning With Noisy Text SupervisionCode2
RWKV-CLIP: A Robust Vision-Language Representation LearnerCode2
CVLUE: A New Benchmark Dataset for Chinese Vision-Language Understanding EvaluationCode1
A Deep Local and Global Scene-Graph Matching for Image-Text RetrievalCode1
Cross-View Language Modeling: Towards Unified Cross-Lingual Cross-Modal Pre-trainingCode1
Efficient Vision-Language Pretraining with Visual Concepts and Hierarchical AlignmentCode1
ALIP: Adaptive Language-Image Pre-training with Synthetic CaptionCode1
Show:102550
← PrevPage 3 of 25Next →

No leaderboard results yet.