| F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis | Mar 27, 2024 | Depth EstimationIndoor Monocular Depth Estimation | —Unverified | 0 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 |
| Federated reinforcement learning for robot motion planning with zero-shot generalization | Mar 20, 2024 | Motion PlanningZero-shot Generalization | —Unverified | 0 |
| Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets | Mar 19, 2024 | Computed Tomography (CT)Segmentation | —Unverified | 0 |
| Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Mar 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity | Mar 18, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model | Mar 17, 2024 | Image RestorationZero-shot Generalization | CodeCode Available | 2 |
| Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization | Mar 16, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation | Mar 15, 2024 | Myocardium SegmentationSegmentation | —Unverified | 0 |
| FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Mar 14, 2024 | 3D Medical Imaging SegmentationGPU | CodeCode Available | 1 |
| Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models | Mar 14, 2024 | Continual LearningKnowledge Distillation | —Unverified | 0 |
| SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration | Mar 14, 2024 | Transfer LearningZero-shot Generalization | —Unverified | 0 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation | Mar 12, 2024 | DiagnosticImage Segmentation | CodeCode Available | 1 |
| RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model | Mar 12, 2024 | Change DetectionZero-shot Generalization | CodeCode Available | 2 |
| In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model | Mar 10, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Mar 7, 2024 | DenoisingInstance Segmentation | CodeCode Available | 0 |
| Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection | Mar 4, 2024 | Incremental Learningobject-detection | CodeCode Available | 1 |
| Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV | Mar 3, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality images | Feb 27, 2024 | SegmentationTumor Segmentation | CodeCode Available | 0 |
| Multimodal Instruction Tuning with Conditional Mixture of LoRA | Feb 24, 2024 | parameter-efficient fine-tuningZero-shot Generalization | CodeCode Available | 1 |
| Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot Generalization | Feb 23, 2024 | AttributeCombinatorial Optimization | CodeCode Available | 1 |
| IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus | Feb 22, 2024 | Zero-shot Generalization | CodeCode Available | 3 |
| ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | Feb 21, 2024 | MMLURetrieval | CodeCode Available | 0 |
| Zero-shot generalization across architectures for visual classification | Feb 21, 2024 | ClassificationZero-shot Generalization | CodeCode Available | 0 |