| Context-Enhanced Video Moment Retrieval with Large Language Models | May 21, 2024 | cross-modal alignmentLanguage Modeling | —Unverified | 0 | 0 |
| Continual learning in cross-modal retrieval | Apr 14, 2021 | Continual Learningcross-modal alignment | —Unverified | 0 | 0 |
| Continuous Sign Language Recognition Through Cross-Modal Alignment of Video and Text Embeddings in a Joint-Latent Space | May 11, 2020 | cross-modal alignmentDecoder | —Unverified | 0 | 0 |
| COST: Contrastive One-Stage Transformer for Vision-Language Small Object Tracking | Apr 2, 2025 | cross-modal alignmentObject | —Unverified | 0 | 0 |
| CoVLR: Coordinating Cross-Modal Consistency and Intra-Modal Structure for Vision-Language Retrieval | Apr 15, 2023 | cross-modal alignmentCross-Modal Retrieval | —Unverified | 0 | 0 |
| Cross-attention for State-based model RWKV-7 | Apr 19, 2025 | cross-modal alignmentImage Generation | —Unverified | 0 | 0 |
| Cross-aware Early Fusion with Stage-divided Vision and Language Transformer Encoders for Referring Image Segmentation | Aug 14, 2024 | cross-modal alignmentImage Segmentation | —Unverified | 0 | 0 |
| Cross-Modal Alignment Learning of Vision-Language Conceptual Systems | Jul 31, 2022 | cross-modal alignmentRepresentation Learning | —Unverified | 0 | 0 |
| Cross-Modal Alignment with Mixture Experts Neural Network for Intral-City Retail Recommendation | Sep 17, 2020 | cross-modal alignmentImage to text | —Unverified | 0 | 0 |
| Cross-modal Alignment with Optimal Transport for CTC-based ASR | Sep 24, 2023 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |