| In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model | Mar 10, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Mar 7, 2024 | DenoisingInstance Segmentation | CodeCode Available | 0 |
| Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection | Mar 4, 2024 | Incremental Learningobject-detection | CodeCode Available | 1 |
| Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV | Mar 3, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality images | Feb 27, 2024 | SegmentationTumor Segmentation | CodeCode Available | 0 |
| Multimodal Instruction Tuning with Conditional Mixture of LoRA | Feb 24, 2024 | parameter-efficient fine-tuningZero-shot Generalization | CodeCode Available | 1 |
| Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot Generalization | Feb 23, 2024 | AttributeCombinatorial Optimization | CodeCode Available | 1 |
| IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus | Feb 22, 2024 | Zero-shot Generalization | CodeCode Available | 3 |
| ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | Feb 21, 2024 | MMLURetrieval | CodeCode Available | 0 |
| Zero-shot generalization across architectures for visual classification | Feb 21, 2024 | ClassificationZero-shot Generalization | CodeCode Available | 0 |