| READ: Recurrent Adaptation of Large Transformers | May 24, 2023 | GPUTransfer Learning | —Unverified | 0 |
| Graph Analysis Using a GPU-based Parallel Algorithm: Quantum Clustering | May 24, 2023 | ClusteringGPU | —Unverified | 0 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 |
| An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level | May 23, 2023 | GPUImage Segmentation | CodeCode Available | 1 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks | May 23, 2023 | AttributeDataset Generation | CodeCode Available | 1 |
| Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding | May 22, 2023 | GPUIn-Context Learning | —Unverified | 0 |
| Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference | May 22, 2023 | Computational EfficiencyGPU | CodeCode Available | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching | May 20, 2023 | CPUGPU | —Unverified | 0 |