| Language-based Image Colorization: A Benchmark and Beyond | Mar 19, 2025 | BenchmarkingColorization | CodeCode Available | 0 | 5 |
| Language-Guided Diffusion Model for Visual Grounding | Aug 18, 2023 | cross-modal alignmentDenoising | CodeCode Available | 0 | 5 |
| KALE: An Artwork Image Captioning System Augmented with Heterogeneous Graph | Sep 17, 2024 | cross-modal alignmentImage Captioning | CodeCode Available | 0 | 5 |
| KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation | Sep 22, 2021 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 0 | 5 |
| Craft: Cross-modal Aligned Features Improve Robustness of Prompt Tuning | Jul 22, 2024 | cross-modal alignment | CodeCode Available | 0 | 5 |
| Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information | Apr 19, 2021 | cross-modal alignmentNavigate | CodeCode Available | 0 | 5 |
| Learning Contextual Tag Embeddings for Cross-Modal Alignment of Audio and Tags | Oct 27, 2020 | cross-modal alignmentRepresentation Learning | CodeCode Available | 0 | 5 |
| Align before Search: Aligning Ads Image to Text for Accurate Cross-Modal Sponsored Search | Sep 28, 2023 | cross-modal alignmentCross-Modal Retrieval | CodeCode Available | 0 | 5 |
| Contrast-augmented Diffusion Model with Fine-grained Sequence Alignment for Markup-to-Image Generation | Aug 2, 2023 | cross-modal alignmentDenoising | CodeCode Available | 0 | 5 |
| Focus on Focus: Focus-oriented Representation Learning and Multi-view Cross-modal Alignment for Glioma Grading | Aug 16, 2024 | Contrastive Learningcross-modal alignment | CodeCode Available | 0 | 5 |