SOTAVerified

Image to text

Papers

Showing 8190 of 246 papers

TitleStatusHype
Dallah: A Dialect-Aware Multimodal Large Language Model for Arabic0
Cross-modal Contrastive Attention Model for Medical Report Generation0
BIMCV-R: A Landmark Dataset for 3D CT Text-Image Retrieval0
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation0
Cross-Modal Adaptive Dual Association for Text-to-Image Person Retrieval0
BiLMa: Bidirectional Local-Matching for Text-based Person Re-identification0
An End-to-End Neural Network for Image-to-Audio Transformation0
COTS: Collaborative Two-Stream Vision-Language Pre-Training Model for Cross-Modal Retrieval0
Contrastive Learning of Visual-Semantic Embeddings0
Beyond Images: An Integrative Multi-modal Approach to Chest X-Ray Report Generation0
Show:102550
← PrevPage 9 of 25Next →

No leaderboard results yet.