SOTAVerified

Image to text

Papers

Showing 191200 of 246 papers

TitleStatusHype
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP0
Towards a Visual-Language Foundation Model for Computational Pathology0
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering0
TrojVLM: Backdoor Attack Against Vision Language Models0
Turbo Learning for Captionbot and Drawingbot0
Two-stream Hierarchical Similarity Reasoning for Image-text Matching0
Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations0
Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning0
UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation0
Using Inter-Sentence Diverse Beam Search to Reduce Redundancy in Visual Storytelling0
Show:102550
← PrevPage 20 of 25Next →

No leaderboard results yet.