SOTAVerified

cross-modal alignment

Papers

Showing 1120 of 342 papers

TitleStatusHype
DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal AlignmentCode2
ReID5o: Achieving Omni Multi-modal Person Re-identification in a Single ModelCode2
mmE5: Improving Multimodal Multilingual Embeddings via High-quality Synthetic DataCode2
Deciphering Cross-Modal Alignment in Large Vision-Language Models with Modality Integration RateCode2
Melody-Guided Music GenerationCode2
Law of Vision Representation in MLLMsCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
DeCo: Decoupling Token Compression from Semantic Abstraction in Multimodal Large Language ModelsCode2
Seeing the Image: Prioritizing Visual Correlation by Contrastive AlignmentCode2
Show:102550
← PrevPage 2 of 35Next →

No leaderboard results yet.