SOTAVerified

Image to text

Papers

Showing 141150 of 246 papers

TitleStatusHype
X-Fusion: Introducing New Modality to Frozen Large Language Models0
15M Multimodal Facial Image-Text Dataset0
Ziya-Visual: Bilingual Large Vision-Language Model via Multi-Task Instruction Tuning0
Towards Cross-modal Retrieval in Chinese Cultural Heritage Documents: Dataset and Solution0
ABC: Achieving Better Control of Multimodal Embeddings using VLMs0
Accept the Modality Gap: An Exploration in the Hyperbolic Space0
Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training0
AICoderEval: Improving AI Domain Code Generation of Large Language Models0
AI Recommendation System for Enhanced Customer Experience: A Novel Image-to-Text Method0
An End-to-End Neural Network for Image-to-Audio Transformation0
Show:102550
← PrevPage 15 of 25Next →

No leaderboard results yet.