| Turning a CLIP Model into a Scene Text Spotter | Aug 21, 2023 | object-detectionObject Detection | CodeCode Available | 2 |
| Self-regulating Prompts: Foundational Model Adaptation without Forgetting | Jul 13, 2023 | Diversitymodel | CodeCode Available | 2 |
| RSPrompter: Learning to Prompt for Remote Sensing Instance Segmentation based on Visual Foundation Model | Jun 28, 2023 | Image SegmentationInstance Segmentation | CodeCode Available | 2 |
| Visual Prompt Multi-Modal Tracking | Mar 20, 2023 | Object TrackingPrompt Learning | CodeCode Available | 2 |
| Decoupling Knowledge from Memorization: Retrieval-augmented Prompt Learning | May 29, 2022 | Few-Shot Text ClassificationMemorization | CodeCode Available | 2 |
| PromptDet: Towards Open-vocabulary Detection using Uncurated Images | Mar 30, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| OpenPrompt: An Open-source Framework for Prompt-learning | Nov 3, 2021 | Prompt Learning | CodeCode Available | 2 |
| Learning to Prompt for Vision-Language Models | Sep 2, 2021 | Domain GeneralizationFew-shot Age Estimation | CodeCode Available | 2 |
| Taming Vision-Language Models for Medical Image Analysis: A Comprehensive Review | Jun 23, 2025 | Medical Image AnalysisPrompt Learning | CodeCode Available | 1 |
| Foundation Molecular Grammar: Multi-Modal Foundation Models Induce Interpretable Molecular Graph Languages | May 29, 2025 | DiversityPrompt Learning | CodeCode Available | 1 |