| Bayesian Optimization for auto-tuning GPU kernels | Nov 26, 2021 | Bayesian OptimizationGPU | CodeCode Available | 1 | 5 |
| Apt-Serve: Adaptive Request Scheduling on Hybrid Cache for Scalable LLM Inference Serving | Apr 10, 2025 | GPULarge Language Model | CodeCode Available | 1 | 5 |
| Dr. Top-k: Delegate-Centric Top-k on GPUs | Sep 16, 2021 | GPU | CodeCode Available | 1 | 5 |
| Is This Loss Informative? Faster Text-to-Image Customization by Tracking Objective Dynamics | Feb 9, 2023 | GPUImage Generation | CodeCode Available | 1 | 5 |
| DSXplore: Optimizing Convolutional Neural Networks via Sliding-Channel Convolutions | Jan 4, 2021 | GPU | CodeCode Available | 1 | 5 |
| Microscopy Image Restoration using Deep Learning on W2S | Apr 22, 2020 | CPUDeep Learning | CodeCode Available | 1 | 5 |
| DropIT: Dropping Intermediate Tensors for Memory-Efficient DNN Training | Feb 28, 2022 | GPUInstance Segmentation | CodeCode Available | 1 | 5 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 | 5 |
| A C Code Generator for Fast Inference and Simple Deployment of Convolutional Neural Networks on Resource Constrained Systems | Jan 14, 2020 | C++ codeCode Generation | CodeCode Available | 1 | 5 |
| DR-SPAAM: A Spatial-Attention and Auto-regressive Model for Person Detection in 2D Range Data | Apr 29, 2020 | GPUHuman Detection | CodeCode Available | 1 | 5 |