| Graph Analysis Using a GPU-based Parallel Algorithm: Quantum Clustering | May 24, 2023 | ClusteringGPU | —Unverified | 0 |
| Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding | May 22, 2023 | GPUIn-Context Learning | —Unverified | 0 |
| Flover: A Temporal Fusion Framework for Efficient Autoregressive Model Parallel Inference | May 22, 2023 | Computational EfficiencyGPU | CodeCode Available | 0 |
| Integer or Floating Point? New Outlooks for Low-Bit Quantization on Large Language Models | May 21, 2023 | GPUQuantization | —Unverified | 0 |
| DexPBT: Scaling up Dexterous Manipulation for Hand-Arm Systems with Population Based Training | May 20, 2023 | Deep Reinforcement LearningGPU | —Unverified | 0 |
| Taming Resource Heterogeneity In Distributed ML Training With Dynamic Batching | May 20, 2023 | CPUGPU | —Unverified | 0 |
| PANNA 2.0: Efficient neural network interatomic potentials and new architectures | May 19, 2023 | Efficient Neural NetworkGPU | CodeCode Available | 0 |
| Boost Vision Transformer with GPU-Friendly Sparsity and Quantization | May 18, 2023 | BenchmarkingGPU | —Unverified | 0 |
| ACRoBat: Optimizing Auto-batching of Dynamic Deep Learning at Compile Time | May 17, 2023 | Code GenerationDeep Learning | —Unverified | 0 |
| Facial Expression Recognition at the Edge: CPU vs GPU vs VPU vs TPU | May 17, 2023 | CPUFacial Expression Recognition | —Unverified | 0 |