SOTAVerified

cross-modal alignment

Papers

Showing 181190 of 342 papers

TitleStatusHype
Cross-Modal Attention Alignment Network with Auxiliary Text Description for zero-shot sketch-based image retrieval0
MLLM as Video Narrator: Mitigating Modality Imbalance in Video Moment Retrieval0
Mitigate the Gap: Investigating Approaches for Improving Cross-Modal Alignment in CLIPCode2
PRESTO: Progressive Pretraining Enhances Synthetic Chemistry OutcomesCode1
Flash-VStream: Memory-Based Real-Time Understanding for Long Video StreamsCode3
It is Never Too Late to Mend: Separate Learning for Multimedia RecommendationCode0
MMPolymer: A Multimodal Multitask Pretraining Framework for Polymer Property PredictionCode1
Hire: Hybrid-modal Interaction with Multiple Relational Enhancements for Image-Text Matching0
Multimodal Reasoning with Multimodal Knowledge Graph0
Collaborative Novel Object Discovery and Box-Guided Cross-Modal Alignment for Open-Vocabulary 3D Object DetectionCode3
Show:102550
← PrevPage 19 of 35Next →

No leaderboard results yet.