| WiCo: Win-win Cooperation of Bottom-up and Top-down Referring Image Segmentation | Jun 19, 2023 | cross-modal alignmentImage Segmentation | —Unverified | 0 | 0 |
| Language Model Mapping in Multimodal Music Learning: A Grand Challenge Proposal | Mar 1, 2025 | cross-modal alignmentLanguage Modeling | —Unverified | 0 | 0 |
| VISTA: Enhancing Vision-Text Alignment in MLLMs via Cross-Modal Mutual Information Maximization | May 16, 2025 | cross-modal alignmentMME | —Unverified | 0 | 0 |
| FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Pretraining | May 16, 2025 | cross-modal alignment | —Unverified | 0 | 0 |
| 4D-ACFNet: A 4D Attention Mechanism-Based Prognostic Framework for Colorectal Cancer Liver Metastasis Integrating Multimodal Spatiotemporal Features | Mar 12, 2025 | cross-modal alignmentDisentanglement | —Unverified | 0 | 0 |
| ACMM: Aligned Cross-Modal Memory for Few-Shot Image and Sentence Matching | Oct 1, 2019 | cross-modal alignmentSentence | —Unverified | 0 | 0 |
| ALAS: Measuring Latent Speech-Text Alignment For Spoken Language Understanding In Multimodal LLMs | May 26, 2025 | cross-modal alignmentEmotion Recognition | —Unverified | 0 | 0 |
| AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability | May 23, 2024 | cross-modal alignmentLanguage Modelling | —Unverified | 0 | 0 |
| AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment | Dec 1, 2024 | cross-modal alignmentMamba | —Unverified | 0 | 0 |
| AlignSTS: Speech-to-Singing Conversion via Cross-Modal Alignment | May 8, 2023 | cross-modal alignmentRhythm | —Unverified | 0 | 0 |