| Amortized Active Causal Induction with Deep Reinforcement Learning | May 26, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| M^3GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation | May 25, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SMART: Scalable Multi-agent Real-time Motion Generation via Next-token Prediction | May 24, 2024 | Autonomous DrivingMotion Generation | CodeCode Available | 3 |
| Gradient Projection For Continual Parameter-Efficient Tuning | May 22, 2024 | Continual LearningHallucination | —Unverified | 0 |
| Prompt Learning for Generalized Vehicle Routing | May 20, 2024 | Combinatorial OptimizationPrompt Learning | CodeCode Available | 0 |
| Revisiting the Robust Generalization of Adversarial Prompt Tuning | May 18, 2024 | Adversarial RobustnessPrompt Learning | —Unverified | 0 |
| A Minimalist Prompt for Zero-Shot Policy Learning | May 9, 2024 | Zero-shot Generalization | —Unverified | 0 |
| Enhancing Vision-Language Models Generalization via Diversity-Driven Novel Feature Synthesis | May 4, 2024 | DiversityZero-shot Generalization | —Unverified | 0 |
| On the test-time zero-shot generalization of vision-language models: Do we really need prompt learning? | May 3, 2024 | Computational EfficiencyPrompt Learning | CodeCode Available | 2 |
| MVMoE: Multi-Task Vehicle Routing Solver with Mixture-of-Experts | May 2, 2024 | Combinatorial OptimizationMixture-of-Experts | CodeCode Available | 3 |
| Instruction Matters: A Simple yet Effective Task Selection for Optimized Instruction Tuning of Specific Tasks | Apr 25, 2024 | Zero-shot Generalization | CodeCode Available | 0 |
| The Third Monocular Depth Estimation Challenge | Apr 25, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| CompilerDream: Learning a Compiler World Model for General Code Optimization | Apr 24, 2024 | DiversityModel-based Reinforcement Learning | CodeCode Available | 1 |
| Enabling Natural Zero-Shot Prompting on Encoder Models via Statement-Tuning | Apr 19, 2024 | DiversityZero-shot Generalization | —Unverified | 0 |
| Inferring Behavior-Specific Context Improves Zero-Shot Generalization in Reinforcement Learning | Apr 15, 2024 | reinforcement-learningReinforcement Learning | CodeCode Available | 0 |
| PromptSync: Bridging Domain Gaps in Vision-Language Models through Class-Aware Prototype Alignment and Discrimination | Apr 11, 2024 | Contrastive LearningDomain Generalization | —Unverified | 0 |
| GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis | Apr 9, 2024 | Image GenerationZero-shot Generalization | CodeCode Available | 2 |
| Visually Descriptive Language Model for Vector Graphics Reasoning | Apr 9, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 9 |
| CLIP-Embed-KD: Computationally Efficient Knowledge Distillation Using Embeddings as Teachers | Apr 9, 2024 | Knowledge DistillationZero-shot Generalization | CodeCode Available | 1 |
| DiffCJK: Conditional Diffusion Model for High-Quality and Wide-coverage CJK Character Generation | Apr 8, 2024 | Zero-shot Generalization | —Unverified | 0 |
| No "Zero-Shot" Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance | Apr 4, 2024 | BenchmarkingImage Generation | CodeCode Available | 2 |
| Know Your Neighbors: Improving Single-View Reconstruction via Spatial Vision-Language Reasoning | Apr 4, 2024 | 3D Scene ReconstructionDepth Estimation | CodeCode Available | 2 |
| Decision Transformer as a Foundation Model for Partially Observable Continuous Control | Apr 3, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction | Apr 3, 2024 | Image GenerationImage Reconstruction | CodeCode Available | 9 |
| Where to Move Next: Zero-shot Generalization of LLMs for Next POI Recommendation | Apr 2, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| F^2Depth: Self-supervised Indoor Monocular Depth Estimation via Optical Flow Consistency and Feature Map Synthesis | Mar 27, 2024 | Depth EstimationIndoor Monocular Depth Estimation | —Unverified | 0 |
| Metric3Dv2: A Versatile Monocular Geometric Foundation Model for Zero-shot Metric Depth and Surface Normal Estimation | Mar 22, 2024 | Depth EstimationSurface Normal Estimation | CodeCode Available | 7 |
| Federated reinforcement learning for robot motion planning with zero-shot generalization | Mar 20, 2024 | Motion PlanningZero-shot Generalization | —Unverified | 0 |
| Quantifying uncertainty in lung cancer segmentation with foundation models applied to mixed-domain datasets | Mar 19, 2024 | Computed Tomography (CT)Segmentation | —Unverified | 0 |
| Just Shift It: Test-Time Prototype Shifting for Zero-Shot Generalization with Vision-Language Models | Mar 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity | Mar 18, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model | Mar 17, 2024 | Image RestorationZero-shot Generalization | CodeCode Available | 2 |
| Dreaming of Many Worlds: Learning Contextual World Models Aids Zero-Shot Generalization | Mar 16, 2024 | Zero-shot Generalization | CodeCode Available | 1 |
| Temporal-spatial Adaptation of Promptable SAM Enhance Accuracy and Generalizability of cine CMR Segmentation | Mar 15, 2024 | Myocardium SegmentationSegmentation | —Unverified | 0 |
| FastSAM3D: An Efficient Segment Anything Model for 3D Volumetric Medical Images | Mar 14, 2024 | 3D Medical Imaging SegmentationGPU | CodeCode Available | 1 |
| Select and Distill: Selective Dual-Teacher Knowledge Transfer for Continual Learning on Vision-Language Models | Mar 14, 2024 | Continual LearningKnowledge Distillation | —Unverified | 0 |
| SAM-Lightening: A Lightweight Segment Anything Model with Dilated Flash Attention to Achieve 30 times Acceleration | Mar 14, 2024 | Transfer LearningZero-shot Generalization | —Unverified | 0 |
| Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment Anything | Mar 12, 2024 | GPUPoint Tracking | CodeCode Available | 1 |
| FluoroSAM: A Language-aligned Foundation Model for X-ray Image Segmentation | Mar 12, 2024 | DiagnosticImage Segmentation | CodeCode Available | 1 |
| RSBuilding: Towards General Remote Sensing Image Building Extraction and Change Detection with Foundation Model | Mar 12, 2024 | Change DetectionZero-shot Generalization | CodeCode Available | 2 |
| In-context Prompt Learning for Test-time Vision Recognition with Frozen Vision-language Model | Mar 10, 2024 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| SAM-PD: How Far Can SAM Take Us in Tracking and Segmenting Anything in Videos by Prompt Denoising | Mar 7, 2024 | DenoisingInstance Segmentation | CodeCode Available | 0 |
| Zero-shot Generalizable Incremental Learning for Vision-Language Object Detection | Mar 4, 2024 | Incremental Learningobject-detection | CodeCode Available | 1 |
| Kick Back & Relax++: Scaling Beyond Ground-Truth Depth with SlowTV & CribsTV | Mar 3, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 2 |
| Segment anything model for head and neck tumor segmentation with CT, PET and MRI multi-modality images | Feb 27, 2024 | SegmentationTumor Segmentation | CodeCode Available | 0 |
| Multimodal Instruction Tuning with Conditional Mixture of LoRA | Feb 24, 2024 | parameter-efficient fine-tuningZero-shot Generalization | CodeCode Available | 1 |
| Multi-Task Learning for Routing Problem with Cross-Problem Zero-Shot Generalization | Feb 23, 2024 | AttributeCombinatorial Optimization | CodeCode Available | 1 |
| IEPile: Unearthing Large-Scale Schema-Based Information Extraction Corpus | Feb 22, 2024 | Zero-shot Generalization | CodeCode Available | 3 |
| ARL2: Aligning Retrievers for Black-box Large Language Models via Self-guided Adaptive Relevance Labeling | Feb 21, 2024 | MMLURetrieval | CodeCode Available | 0 |
| Zero-shot generalization across architectures for visual classification | Feb 21, 2024 | ClassificationZero-shot Generalization | CodeCode Available | 0 |