| Adaptive Spatial Transcriptomics Interpolation via Cross-modal Cross-slice Modeling | May 15, 2025 | cross-modal alignment | CodeCode Available | 0 |
| Denoising and Alignment: Rethinking Domain Generalization for Multimodal Face Anti-Spoofing | May 14, 2025 | cross-modal alignmentDenoising | —Unverified | 0 |
| Anatomical Attention Alignment representation for Radiology Report Generation | May 12, 2025 | cross-modal alignmentDecoder | CodeCode Available | 0 |
| HCMA: Hierarchical Cross-model Alignment for Grounded Text-to-Image Generation | May 10, 2025 | cross-modal alignmentImage Generation | CodeCode Available | 0 |
| Semantic-Space-Intervened Diffusive Alignment for Visual Classification | May 9, 2025 | Classificationcross-modal alignment | —Unverified | 0 |
| Task-Adapter++: Task-specific Adaptation with Order-aware Alignment for Few-shot Action Recognition | May 9, 2025 | Action Recognitioncross-modal alignment | CodeCode Available | 0 |
| Probabilistic Embeddings for Frozen Vision-Language Models: Uncertainty Quantification with Gaussian Process Latent Variable Models | May 8, 2025 | Active Learningcross-modal alignment | CodeCode Available | 0 |
| DenseGrounding: Improving Dense Language-Vision Semantics for Ego-Centric 3D Visual Grounding | May 8, 2025 | 3D visual groundingcross-modal alignment | —Unverified | 0 |
| PhysLLM: Harnessing Large Language Models for Cross-Modal Remote Physiological Sensing | May 6, 2025 | cross-modal alignment | —Unverified | 0 |
| MicarVLMoE: A Modern Gated Cross-Aligned Vision-Language Mixture of Experts Model for Medical Image Captioning and Report Generation | Apr 29, 2025 | cross-modal alignmentDecoder | CodeCode Available | 0 |