| Neuron to Graph: Interpreting Language Model Neurons at Scale | May 31, 2023 | GPULanguage Modeling | CodeCode Available | 0 |
| Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts | May 30, 2023 | CPUGPU | CodeCode Available | 1 |
| CTSN: Predicting Cloth Deformation for Skeleton-based Characters with a Two-stream Skinning Network | May 30, 2023 | GPU | —Unverified | 0 |
| Bringing regularized optimal transport to lightspeed: a splitting method adapted for GPUs | May 29, 2023 | Domain AdaptationGPU | —Unverified | 0 |
| SlimFit: Memory-Efficient Fine-Tuning of Transformer-based Models Using Training Dynamics | May 29, 2023 | GPUQuantization | —Unverified | 0 |
| Search-Based Regular Expression Inference on a GPU | May 29, 2023 | CPUGPU | CodeCode Available | 1 |
| Fine-Tuning Language Models with Just Forward Passes | May 27, 2023 | GPUIn-Context Learning | CodeCode Available | 3 |
| Accelerating Text-to-Image Editing via Cache-Enabled Sparse Diffusion Inference | May 27, 2023 | GPUImage Generation | CodeCode Available | 2 |
| RT-kNNS Unbound: Using RT Cores to Accelerate Unrestricted Neighbor Search | May 26, 2023 | GPU | —Unverified | 0 |
| Pulse shape discrimination based on the Tempotron: a powerful classifier on GPU | May 26, 2023 | CPUGPU | CodeCode Available | 0 |