| Exploiting Student Parallelism for Low-latency GPU Inference of BERT-like Models in Online Services | Aug 22, 2024 | GPU | —Unverified | 0 |
| PCGRL+: Scaling, Control and Generalization in Reinforcement Learning Level Generators | Aug 22, 2024 | CPUGPU | —Unverified | 0 |
| Visual Localization in 3D Maps: Comparing Point Cloud, Mesh, and NeRF Representations | Aug 21, 2024 | GPUImage Retrieval | —Unverified | 0 |
| MARLIN: Mixed-Precision Auto-Regressive Parallel Inference on Large Language Models | Aug 21, 2024 | GPUQuantization | CodeCode Available | 5 |
| Practical Aspects on Solving Differential Equations Using Deep Learning: A Primer | Aug 21, 2024 | Deep LearningGPU | CodeCode Available | 0 |
| Slicing Input Features to Accelerate Deep Learning: A Case Study with Graph Neural Networks | Aug 21, 2024 | GPUGraph Learning | —Unverified | 0 |
| Vision HgNN: An Electron-Micrograph is Worth Hypergraph of Hypernodes | Aug 21, 2024 | GPU | —Unverified | 0 |
| EmbodiedSAM: Online Segment Any 3D Thing in Real Time | Aug 21, 2024 | 3D Instance SegmentationGPU | CodeCode Available | 4 |
| Mixed Sparsity Training: Achieving 4 FLOP Reduction for Transformer Pretraining | Aug 21, 2024 | GPU | —Unverified | 0 |
| UKAN: Unbound Kolmogorov-Arnold Network Accompanied with Accelerated Library | Aug 20, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |