SOTAVerified

Image to text

Papers

Showing 201210 of 246 papers

TitleStatusHype
Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations0
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval0
Characterizing and Understanding the Behavior of Quantized Models for Reliable DeploymentCode0
Two-stream Hierarchical Similarity Reasoning for Image-text Matching0
A Thousand Words Are Worth More Than a Picture: Natural Language-Centric Outside-Knowledge Visual Question Answering0
EI-CLIP: Entity-Aware Interventional Contrastive Learning for E-Commerce Cross-Modal Retrieval0
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering0
ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language GenerationCode1
Distilled Dual-Encoder Model for Vision-Language UnderstandingCode1
Self-Supervised Image-to-Text and Text-to-Image SynthesisCode0
Show:102550
← PrevPage 21 of 25Next →

No leaderboard results yet.