| MLLMs are Deeply Affected by Modality Bias | May 24, 2025 | cross-modal alignment | —Unverified | 0 |
| ICPL-ReID: Identity-Conditional Prompt Learning for Multi-Spectral Object Re-Identification | May 23, 2025 | cross-modal alignmentPrompt Learning | CodeCode Available | 0 |
| Clip4Retrofit: Enabling Real-Time Image Labeling on Edge Devices via Cross-Architecture CLIP Distillation | May 23, 2025 | Autonomous Drivingcross-modal alignment | —Unverified | 0 |
| Representation Discrepancy Bridging Method for Remote Sensing Image-Text Retrieval | May 22, 2025 | cross-modal alignmentImage-text Retrieval | —Unverified | 0 |
| CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation | May 21, 2025 | cross-modal alignmentDeepFake Detection | —Unverified | 0 |
| ALN-P3: Unified Language Alignment for Perception, Prediction, and Planning in Autonomous Driving | May 21, 2025 | Autonomous Drivingcross-modal alignment | —Unverified | 0 |
| U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding | May 20, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| Enhancing LLMs for Time Series Forecasting via Structure-Guided Cross-Modal Alignment | May 19, 2025 | cross-modal alignmentTime Series | —Unverified | 0 |
| Beyond Modality Collapse: Representations Blending for Multimodal Dataset Distillation | May 16, 2025 | cross-modal alignmentDataset Distillation | —Unverified | 0 |
| FALCON: False-Negative Aware Learning of Contrastive Negatives in Vision-Language Pretraining | May 16, 2025 | cross-modal alignment | —Unverified | 0 |