| Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models | May 8, 2025 | Active Learningcross-modal alignment | CodeCode Available | 0 |
| Asymmetric Cross-Scale Alignment for Text-Based Person Search | Nov 26, 2022 | cross-modal alignmentPerson Search | CodeCode Available | 0 |
| Unmasked Teacher: Towards Training-Efficient Video Foundation Models | Mar 28, 2023 | Action ClassificationAction Recognition | CodeCode Available | 0 |
| Discrete Cross-Modal Alignment Enables Zero-Shot Speech Translation | Oct 18, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 |
| RCRank: Multimodal Ranking of Root Causes of Slow Queries in Cloud Database Systems | Mar 6, 2025 | cross-modal alignment | CodeCode Available | 0 |
| ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification | May 23, 2025 | cross-modal alignmentPrompt Learning | CodeCode Available | 0 |
| HyperPath: Knowledge-Guided Hyperbolic Semantic Hierarchy Modeling for WSI Analysis | Jun 19, 2025 | cross-modal alignmentMultiple Instance Learning | CodeCode Available | 0 |
| Reinforced Cross-modal Alignment for Radiology Report Generation | May 1, 2022 | cross-modal alignmentDecision Making | CodeCode Available | 0 |
| CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement | Feb 19, 2025 | cross-modal alignmentFairness | CodeCode Available | 0 |
| It is Never Too Late to Mend: Separate Learning for Multimedia Recommendation | Jun 12, 2024 | cross-modal alignmentMultimedia recommendation | CodeCode Available | 0 |