SOTAVerified

Image to text

Papers

Showing 110 of 246 papers

TitleStatusHype
Improving Medical Visual Representation Learning with Pathological-level Cross-Modal Alignment and Correlation Exploration0
ChartReasoner: Code-Driven Modality Bridging for Long-Chain Reasoning in Chart Question Answering0
Efficient Medical Vision-Language Alignment Through Adapting Masked Vision ModelsCode1
TNG-CLIP:Training-Time Negation Data Generation for Negation Awareness of CLIP0
BRIT: Bidirectional Retrieval over Unified Image-Text Graph0
Robustifying Vision-Language Models via Dynamic Token Reweighting0
UniMoCo: Unified Modality Completion for Robust Multi-Modal EmbeddingsCode0
Towards Cross-modal Retrieval in Chinese Cultural Heritage Documents: Dataset and Solution0
X-Fusion: Introducing New Modality to Frozen Large Language Models0
SemCORE: A Semantic-Enhanced Generative Cross-Modal Retrieval Framework with MLLMs0
Show:102550
← PrevPage 1 of 25Next →

No leaderboard results yet.