| Search-Based Regular Expression Inference on a GPU | May 29, 2023 | CPUGPU | CodeCode Available | 1 |
| Fine-Tuning Language Models with Just Forward Passes | May 27, 2023 | GPUIn-Context Learning | CodeCode Available | 3 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| RT-kNNS Unbound: Using RT Cores to Accelerate Unrestricted Neighbor Search | May 26, 2023 | GPU | —Unverified | 0 |
| Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU | May 26, 2023 | CPUGPU | CodeCode Available | 0 |
| AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec | May 26, 2023 | CPUGPU | CodeCode Available | 2 |
| MixFormerV2: Efficient Fully Transformer Tracking | May 25, 2023 | CPUGPU | CodeCode Available | 2 |
| Sliding Window Sum Algorithms for Deep Neural Networks | May 25, 2023 | CPUGPU | —Unverified | 0 |
| Ghost in the Minecraft: Generally Capable Agents for Open-World Environments via Large Language Models with Text-based Knowledge and Memory | May 25, 2023 | Common Sense ReasoningCPU | CodeCode Available | 2 |
| Dynamic Data Augmentation via MCTS for Prostate MRI Segmentation | May 25, 2023 | Data AugmentationGPU | CodeCode Available | 0 |
| ComSL: A Composite Speech-Language Model for End-to-End Speech-to-Text Translation | May 24, 2023 | GPULanguage Modeling | CodeCode Available | 1 |
| Harnessing the Power of Large Language Models for Natural Language to First-Order Logic Translation | May 24, 2023 | Formal LogicGPU | CodeCode Available | 1 |
| Optimal Linear Subspace Search: Learning to Construct Fast and High-Quality Schedulers for Diffusion Models | May 24, 2023 | GPUImage Generation | CodeCode Available | 0 |
| AWESOME: GPU Memory-constrained Long Document Summarization using Memory Mechanism and Global Salient Content | May 24, 2023 | Document Summarizationdocument understanding | —Unverified | 0 |
| AutoDepthNet: High Frame Rate Depth Map Reconstruction using Commodity Depth and RGB Cameras | May 24, 2023 | Depth EstimationGPU | —Unverified | 0 |
| READ: Recurrent Adaptation of Large Transformers | May 24, 2023 | GPUTransfer Learning | —Unverified | 0 |
| Graph Analysis Using a GPU-based Parallel Algorithm: Quantum Clustering | May 24, 2023 | ClusteringGPU | —Unverified | 0 |
| QLoRA: Efficient Finetuning of Quantized LLMs | May 23, 2023 | ChatbotGPU | CodeCode Available | 6 |
| An Accelerated Pipeline for Multi-label Renal Pathology Image Segmentation at the Whole Slide Image Level | May 23, 2023 | GPUImage Segmentation | CodeCode Available | 1 |
| Enhancing Detail Preservation for Customized Text-to-Image Generation: A Regularization-Free Approach | May 23, 2023 | GPUImage Generation | CodeCode Available | 2 |
| Goat: Fine-tuned LLaMA Outperforms GPT-4 on Arithmetic Tasks | May 23, 2023 | AttributeDataset Generation | CodeCode Available | 1 |
| Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding | May 22, 2023 | GPUIn-Context Learning | —Unverified | 0 |
| Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference | May 22, 2023 | Computational EfficiencyGPU | CodeCode Available | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching | May 20, 2023 | CPUGPU | —Unverified | 0 |