| ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification | May 23, 2024 | GPUGSM8K | CodeCode Available | 1 |
| Sparse-Tuning: Adapting Vision Transformers with Efficient Fine-tuning and Inference | May 23, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| ReCycle: Resilient Training of Large DNNs using Pipeline Adaptation | May 22, 2024 | GPU | —Unverified | 0 |
| Attention as an RNN | May 22, 2024 | GPUTime Series | CodeCode Available | 1 |
| HoverFast: an accurate, high-throughput, clinically deployable nuclear segmentation tool for brightfield digital pathology images | May 22, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| Adversarial Training of Two-Layer Polynomial and ReLU Activation Networks via Convex Optimization | May 22, 2024 | GPU | CodeCode Available | 0 |
| What is Your Data Worth to GPT? LLM-Scale Data Valuation with Influence Functions | May 22, 2024 | Data ValuationGPU | CodeCode Available | 2 |
| PyramidInfer: Pyramid KV Cache Compression for High-throughput LLM Inference | May 21, 2024 | GPU | CodeCode Available | 1 |
| Personalized Residuals for Concept-Driven Text-to-Image Generation | May 21, 2024 | GPUImage Generation | —Unverified | 0 |
| Parallelization of the K-Means Algorithm with Applications to Big Data Clustering | May 20, 2024 | ClusteringGPU | —Unverified | 0 |