SOTAVerified

Image to text

Papers

Showing 181190 of 246 papers

TitleStatusHype
Category-Oriented Representation Learning for Image to Multi-Modal Retrieval0
Image Captioners Sometimes Tell More Than Images They See0
Interpreting Vision and Language Generative Models with Semantic Visual Priors0
RoCOCO: Robustness Benchmark of MS-COCO to Stress-test Image-Text Matching ModelsCode0
Is Cross-modal Information Retrieval Possible without Training?0
Task-Oriented Multi-Modal Mutual Leaning for Vision-Language Models0
CoBIT: A Contrastive Bi-directional Image-Text Generation Model0
Improving Table Structure Recognition with Visual-Alignment Sequential Coordinate Modeling0
An End-to-End Neural Network for Image-to-Audio Transformation0
VITR: Augmenting Vision Transformers with Relation-Focused Learning for Cross-Modal Information Retrieval0
Show:102550
← PrevPage 19 of 25Next →

No leaderboard results yet.