| DiPEx: Dispersing Prompt Expansion for Class-Agnostic Object Detection | Jun 21, 2024 | Class-agnostic Object DetectionMulti-object discovery | CodeCode Available | 1 | 5 |
| Efficient Multimodal Semantic Segmentation via Dual-Prompt Learning | Dec 1, 2023 | Decoderobject-detection | CodeCode Available | 1 | 5 |
| EarthMarker: A Visual Prompting Multi-modal Large Language Model for Remote Sensing | Jul 18, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 | 5 |
| Efficient 3D-Aware Facial Image Editing via Attribute-Specific Prompt Learning | Jun 6, 2024 | AttributeLanguage Modelling | CodeCode Available | 1 | 5 |
| Diffusion Model is an Effective Planner and Data Synthesizer for Multi-Task Reinforcement Learning | May 29, 2023 | Prompt LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| AAPL: Adding Attributes to Prompt Learning for Vision-Language Models | Apr 25, 2024 | Data AugmentationDomain Generalization | CodeCode Available | 1 | 5 |
| Enhanced OoD Detection through Cross-Modal Alignment of Multi-Modal Representations | Mar 24, 2025 | cross-modal alignmentImage Classification | CodeCode Available | 1 | 5 |
| Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs | Oct 21, 2023 | In-Context LearningPrompt Learning | CodeCode Available | 1 | 5 |
| Active Prompt Learning in Vision Language Models | Nov 18, 2023 | Active LearningPrompt Learning | CodeCode Available | 1 | 5 |
| Discffusion: Discriminative Diffusion Models as Few-shot Vision and Language Learners | May 18, 2023 | Image GenerationImage-text matching | CodeCode Available | 1 | 5 |