SOTAVerified

cross-modal alignment

Papers

Showing 201210 of 342 papers

TitleStatusHype
MMA-DFER: MultiModal Adaptation of unimodal models for Dynamic Facial Expression Recognition in-the-wildCode2
Distributionally Robust Alignment for Medical Federated Vision-Language Pre-training Under Data Heterogeneity0
CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling0
SeCG: Semantic-Enhanced 3D Visual Grounding via Cross-modal Graph AttentionCode0
Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision0
A Cross-Modal Approach to Silent Speech with LLM-Enhanced RecognitionCode1
Multi-modal Attribute Prompting for Vision-Language Models0
Semantics-enhanced Cross-modal Masked Image Modeling for Vision-Language Pre-training0
MENTOR: Multi-level Self-supervised Learning for Multimodal RecommendationCode1
Mind the Modality Gap: Towards a Remote Sensing Vision-Language Model via Cross-modal Alignment0
Show:102550
← PrevPage 21 of 35Next →

No leaderboard results yet.