SOTAVerified

cross-modal alignment

Papers

Showing 326342 of 342 papers

TitleStatusHype
Scene-Intuitive Agent for Remote Embodied Visual Grounding0
Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human GazeCode0
Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and TagsCode0
ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding0
Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos0
Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation0
Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose EstimationCode0
Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm0
Cross-Modal Cross-Domain Moment Alignment Network for Person Search0
Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models0
Continuous Sign Language Recognition Through Cross-Modal Alignment of Video and Text Embeddings in a Joint-Latent Space0
MCQA: Multimodal Co-attention Based Network for Question Answering0
Curriculum Audiovisual Learning0
A coupled autoencoder approach for multi-modal analysis of cell typesCode0
ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching0
Mix and match networks: cross-modal alignment for zero-pair image-to-image translation0
Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces0
Show:102550
← PrevPage 14 of 14Next →

No leaderboard results yet.