SOTAVerified

cross-modal alignment

Papers

Showing 241250 of 342 papers

TitleStatusHype
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
It is Never Too Late to Mend: Separate Learning for Multimedia RecommendationCode0
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching0
Multimodal Reasoning with Multimodal Knowledge Graph0
OmniBind: Teach to Build Unequal-Scale Modality Interaction for Omni-Bind of All0
AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability0
Context-Enhanced Video Moment Retrieval with Large Language Models0
Listen Then See: Video Alignment with Speaker AttentionCode0
Distributionally Robust Alignment for Medical Federated Vision-Language Pre-training Under Data Heterogeneity0
CIRP: Cross-Item Relational Pre-training for Multimodal Product Bundling0
Show:102550
← PrevPage 25 of 35Next →

No leaderboard results yet.