| ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification | May 23, 2025 | cross-modal alignmentPrompt Learning | CodeCode Available | 0 | 5 |
| KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation | Sep 22, 2021 | cross-modal alignmentKnowledge Distillation | CodeCode Available | 0 | 5 |
| HyperPath: Knowledge-Guided Hyperbolic Semantic Hierarchy Modeling for WSI Analysis | Jun 19, 2025 | cross-modal alignmentMultiple Instance Learning | CodeCode Available | 0 | 5 |
| Improving Cross-Modal Alignment in Vision Language Navigation via Syntactic Information | Apr 19, 2021 | cross-modal alignmentNavigate | CodeCode Available | 0 | 5 |
| Gaze-Guided Learning: Avoiding Shortcut Bias in Visual Classification | Apr 8, 2025 | cross-modal alignmentImage Classification | CodeCode Available | 0 | 5 |
| Language-Guided Diffusion Model for Visual Grounding | Aug 18, 2023 | cross-modal alignmentDenoising | CodeCode Available | 0 | 5 |
| Asymmetric Cross-Scale Alignment for Text-Based Person Search | Nov 26, 2022 | cross-modal alignmentPerson Search | CodeCode Available | 0 | 5 |
| HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation | May 10, 2025 | cross-modal alignmentImage Generation | CodeCode Available | 0 | 5 |
| Evaluating Semantic Variation in Text-to-Image Synthesis: A Causal Perspective | Oct 14, 2024 | cross-modal alignmentImage Generation | CodeCode Available | 0 | 5 |
| Enhancing Visual Representation for Text-based Person Searching | Dec 30, 2024 | cross-modal alignmentPerson Search | CodeCode Available | 0 | 5 |