SOTAVerified

Image to text

Papers

Showing 126150 of 246 papers

TitleStatusHype
TMCIR: Token Merge Benefits Composed Image Retrieval0
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP0
Towards a Visual-Language Foundation Model for Computational Pathology0
Transform-Retrieve-Generate: Natural Language-Centric Outside-Knowledge Visual Question Answering0
TrojVLM: Backdoor Attack Against Vision Language Models0
Turbo Learning for Captionbot and Drawingbot0
Two-stream Hierarchical Similarity Reasoning for Image-text Matching0
Uncertainty-based Cross-Modal Retrieval with Probabilistic Representations0
Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning0
UNITE-FND: Reframing Multimodal Fake News Detection through Unimodal Scene Translation0
Using Inter-Sentence Diverse Beam Search to Reduce Redundancy in Visual Storytelling0
Utilizing Resource-Rich Language Datasets for End-to-End Scene Text Recognition in Resource-Poor Languages0
Vision-Braille: An End-to-End Tool for Chinese Braille Image-to-Text Translation0
Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation0
When are Lemons Purple? The Concept Association Bias of Vision-Language Models0
X-Fusion: Introducing New Modality to Frozen Large Language Models0
15M Multimodal Facial Image-Text Dataset0
Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning0
Towards Cross-modal Retrieval in Chinese Cultural Heritage Documents: Dataset and Solution0
ABC: Achieving Better Control of Multimodal Embeddings using VLMs0
Accept the Modality Gap: An Exploration in the Hyperbolic Space0
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training0
AICoderEval: Improving AI Domain Code Generation of Large Language Models0
AI Recommendation System for Enhanced Customer Experience: A Novel Image-to-Text Method0
An End-to-End Neural Network for Image-to-Audio Transformation0
Show:102550
← PrevPage 6 of 10Next →

No leaderboard results yet.