SOTAVerified

Image-text Classification

Papers

Showing 110 of 13 papers

TitleStatusHype
DIFFormer: Scalable (Graph) Transformers Induced by Energy Constrained DiffusionCode2
UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive LearningCode1
GIST: Generating Image-Specific Text for Fine-grained Object ClassificationCode1
GLAMI-1M: A Multilingual Image-Text Fashion DatasetCode1
Towards Unifying Medical Vision-and-Language Pre-training via Soft PromptsCode1
Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!0
Unified Generative and Discriminative Training for Multi-modal Large Language Models0
Leveraging Foundation Models for Multi-modal Federated Learning with Incomplete Modality0
Continuous Geometry-Aware Graph Diffusion via Hyperbolic Neural PDE0
CMA-CLIP: Cross-Modality Attention CLIP for Image-Text Classification0
Show:102550
← PrevPage 1 of 2Next →

No leaderboard results yet.