SOTAVerified

cross-modal alignment

Papers

Showing 131140 of 342 papers

TitleStatusHype
Language Model Mapping in Multimodal Music Learning: A Grand Challenge Proposal0
ChartAdapter: Large Vision-Language Model for Chart Summarization0
DiSa: Directional Saliency-Aware Prompt Learning for Generalizable Vision-Language Models0
CGP-Tuning: Structure-Aware Soft Prompt Tuning for Code Vulnerability Detection0
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment0
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching0
HiTeA: Hierarchical Temporal-Aware Video-Language Pre-training0
DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow0
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
Detection-based Intermediate Supervision for Visual Question Answering0
Show:102550
← PrevPage 14 of 35Next →

No leaderboard results yet.