SOTAVerified

Image to text

Papers

Showing 226246 of 246 papers

TitleStatusHype
Learning Deep Structure-Preserving Image-Text Embeddings0
Learning Pseudo-Labeler beyond Noun Concepts for Open-Vocabulary Object Detection0
Leveraging AI to Generate Audio for User-generated Content in Video Games0
Leveraging Unpaired Data for Vision-Language Generative Models via Cycle Consistency0
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering0
MedM2G: Unifying Medical Multi-Modal Generation via Cross-Guided Diffusion with Visual Invariant0
MFP-CLIP: Exploring the Efficacy of Multi-Form Prompts for Zero-Shot Industrial Anomaly Detection0
Category-Oriented Representation Learning for Image to Multi-Modal Retrieval0
Multilingual Image Corpus – Towards a Multimodal and Multilingual Dataset0
Multimodal Intelligence: Representation Learning, Information Fusion, and Applications0
Multimodal Neurons in Pretrained Text-Only Transformers0
Natural Language Generation0
Natural Language Generation from Visual Sequences: Challenges and Future Directions0
Offline Detection of Misspelled Handwritten Words by Convolving Recognition Model Features with Text Labels0
On the Importance of Text Preprocessing for Multimodal Representation Learning and Pathology Report Generation0
OVFoodSeg: Elevating Open-Vocabulary Food Image Segmentation via Image-Informed Textual Representation0
Paired Cross-Modal Data Augmentation for Fine-Grained Image-to-Text Retrieval0
Patch is Enough: Naturalistic Adversarial Patch against Vision-Language Pre-training Models0
PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting0
RefineNet: Enhancing Text-to-Image Conversion with High-Resolution and Detail Accuracy through Hierarchical Transformers and Progressive Refinement0
Reinforced UI Instruction Grounding: Towards a Generic UI Task Automation API0
Show:102550
← PrevPage 10 of 10Next →

No leaderboard results yet.