SOTAVerified

cross-modal alignment

Papers

Showing 321330 of 342 papers

TitleStatusHype
KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge DistillationCode0
Learning Joint Embedding with Modality Alignments for Cross-Modal Retrieval of Recipes and Food Images0
Structured Multi-modal Feature Embedding and Alignment for Image-Sentence Retrieval0
Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic InformationCode0
Continual learning in cross-modal retrieval0
Scene-Intuitive Agent for Remote Embodied Visual Grounding0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and TagsCode0
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding0
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos0
Show:102550
← PrevPage 33 of 35Next →

No leaderboard results yet.