SOTAVerified

cross-modal alignment

Papers

Showing 271280 of 342 papers

TitleStatusHype
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix0
VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering0
Wearable Accelerometer Foundation Models for Health via Knowledge Distillation0
WhisQ: Cross-Modal Representation Learning for Text-to-Music MOS Prediction0
Multi-level Cross-modal Alignment for Image Clustering0
Multi-modal Attribute Prompting for Vision-Language Models0
Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval0
Multimodal Machine Learning in Mental Health: A Survey of Data, Algorithms, and Challenges0
Multimodal Reasoning with Multimodal Knowledge Graph0
Multi-path Exploration and Feedback Adjustment for Text-to-Image Person Retrieval0
Show:102550
← PrevPage 28 of 35Next →

No leaderboard results yet.