| Scene-Intuitive Agent for Remote Embodied Visual Grounding | Mar 24, 2021 | cross-modal alignmentNavigate | —Unverified | 0 |
| Generating Image Descriptions via Sequential Cross-Modal Alignment Guided by Human Gaze | Nov 9, 2020 | cross-modal alignmentImage Captioning | CodeCode Available | 0 |
| Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags | Oct 27, 2020 | cross-modal alignmentRepresentation Learning | CodeCode Available | 0 |
| ST-BERT: Cross-modal Language Model Pre-training For End-to-end Spoken Language Understanding | Oct 23, 2020 | cross-modal alignmentLanguage Modeling | —Unverified | 0 |
| Reinforcement Learning for Weakly Supervised Temporal Grounding of Natural Language in Untrimmed Videos | Sep 18, 2020 | cross-modal alignmentreinforcement-learning | —Unverified | 0 |
| Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation | Sep 17, 2020 | cross-modal alignmentImage to text | —Unverified | 0 |
| Unsupervised Cross-Modal Alignment for Multi-Person 3D Pose Estimation | Aug 4, 2020 | 2D Pose Estimation3D Human Pose Estimation | CodeCode Available | 0 |
| Learning Multi-Modal Nonlinear Embeddings: Performance Bounds and an Algorithm | Jun 3, 2020 | cross-modal alignmentGeneral Classification | —Unverified | 0 |
| Cross-Modal Cross-Domain Moment Alignment Network for Person Search | Jun 1, 2020 | cross-modal alignmentPerson Search | —Unverified | 0 |
| Behind the Scene: Revealing the Secrets of Pre-trained Vision-and-Language Models | May 15, 2020 | coreference-resolutionCoreference Resolution | —Unverified | 0 |
| Continuous Sign Language Recognition Through Cross-Modal Alignment of Video and Text Embeddings in a Joint-Latent Space | May 11, 2020 | cross-modal alignmentDecoder | —Unverified | 0 |
| MCQA: Multimodal Co-attention Based Network for Question Answering | Apr 25, 2020 | cross-modal alignmentQuestion Answering | —Unverified | 0 |
| Curriculum Audiovisual Learning | Jan 26, 2020 | Clusteringcross-modal alignment | —Unverified | 0 |
| A coupled autoencoder approach for multi-modal analysis of cell types | Nov 6, 2019 | Clusteringcross-modal alignment | CodeCode Available | 0 |
| ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching | Oct 1, 2019 | cross-modal alignmentSentence | —Unverified | 0 |
| Mix and match networks: cross-modal alignment for zero-pair image-to-image translation | Mar 8, 2019 | cross-modal alignmentDecoder | —Unverified | 0 |
| Unsupervised Cross-Modal Alignment of Speech and Text Embedding Spaces | May 18, 2018 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |