SOTAVerified

cross-modal alignment

Papers

Showing 151160 of 342 papers

TitleStatusHype
A Multi-Agent Framework for Automated Qinqiang Opera Script Generation Using Large Language Models0
Cross-attention for State-based model RWKV-70
TMCIR: Token Merge Benefits Composed Image Retrieval0
3D CoCa: Contrastive Learners are 3D CaptionersCode0
InfoMAE: Pair-Efficient Cross-Modal Alignment for Multimodal Time-Series Sensing Signals0
VLMT: Vision-Language Multimodal Transformer for Multimodal Multi-hop Question Answering0
SE4Lip: Speech-Lip Encoder for Talking Head Synthesis to Solve Phoneme-Viseme Alignment Ambiguity0
Gaze-Guided Learning: Avoiding Shortcut Bias in Visual ClassificationCode0
DF-Calib: Targetless LiDAR-Camera Calibration via Depth Flow0
Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval0
Show:102550
← PrevPage 16 of 35Next →

No leaderboard results yet.