| SAM^Med: A medical image annotation framework based on large vision model | Jul 11, 2023 | Image SegmentationLiver Segmentation | —Unverified | 0 |
| Mechanistic Understandings of Representation Vulnerabilities and Engineering Robust Vision Transformers | Feb 7, 2025 | Zero-shot Generalization | —Unverified | 0 |
| MIFNet: Learning Modality-Invariant Features for Generalizable Multimodal Image Matching | Jan 20, 2025 | Keypoint DetectionZero-shot Generalization | —Unverified | 0 |
| Mixture of Cluster-conditional LoRA Experts for Vision-language Instruction Tuning | Dec 19, 2023 | DiversityInstruction Following | —Unverified | 0 |
| MmAP : Multi-modal Alignment Prompt for Cross-domain Multi-task Learning | Dec 14, 2023 | DecoderLanguage Modelling | —Unverified | 0 |
| Mono2Stereo: Monocular Knowledge Transfer for Enhanced Stereo Matching | Nov 14, 2024 | Depth EstimationKnowledge Distillation | —Unverified | 0 |
| Multiple Consistency-guided Test-Time Adaptation for Contrastive Audio-Language Models with Unlabeled Audio | Dec 23, 2024 | Contrastive LearningPrompt Learning | —Unverified | 0 |
| Multi-View Unsupervised Image Generation with Cross Attention Guidance | Dec 7, 2023 | Hard AttentionImage Generation | —Unverified | 0 |
| Neural Attention Memory | Feb 18, 2023 | Few-Shot LearningZero-shot Generalization | —Unverified | 0 |
| Neural Field Dynamics Model for Granular Object Piles Manipulation | Nov 1, 2023 | ObjectZero-shot Generalization | —Unverified | 0 |