SOTAVerified

cross-modal alignment

Papers

Showing 301310 of 342 papers

TitleStatusHype
Masked Vision and Language Modeling for Multi-modal Representation Learning0
MCAD: Multi-teacher Cross-modal Alignment Distillation for efficient image-text retrieval0
MCQA: Multimodal Co-attention Based Network for Question Answering0
MDE: Modality Discrimination Enhancement for Multi-modal Recommendation0
Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment0
Distributionally Robust Alignment for Medical Federated Vision-Language Pre-training Under Data Heterogeneity0
Mix and match networks: cross-modal alignment for zero-pair image-to-image translation0
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
MLLMs are Deeply Affected by Modality Bias0
Modeling the Human Visual System: Comparative Insights from Response-Optimized and Task-Optimized Vision Models, Language Models, and different Readout Mechanisms0
Show:102550
← PrevPage 31 of 35Next →

No leaderboard results yet.