| WaferLLM: Large Language Model Inference at Wafer Scale | Feb 6, 2025 | GPULanguage Modeling | CodeCode Available | 2 |
| An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep Networks | Jan 23, 2025 | GPU | CodeCode Available | 2 |
| Recurrent Diffusion for Large-Scale Parameter Generation | Jan 20, 2025 | GPU | CodeCode Available | 2 |
| A User's Guide to KSig: GPU-Accelerated Computation of the Signature Kernel | Jan 13, 2025 | GPU | CodeCode Available | 2 |
| Generalized and Efficient 2D Gaussian Splatting for Arbitrary-scale Super-Resolution | Jan 12, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| TakuNet: an Energy-Efficient CNN for Real-Time Inference on Embedded UAV systems in Emergency Response Scenarios | Jan 10, 2025 | Aerial Scene ClassificationCPU | CodeCode Available | 2 |
| MBQ: Modality-Balanced Quantization for Large Vision-Language Models | Dec 27, 2024 | GPUQuantization | CodeCode Available | 2 |
| ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting | Dec 17, 2024 | GPUWeather Forecasting | CodeCode Available | 2 |
| FlashRNN: Optimizing Traditional RNNs on Modern Hardware | Dec 10, 2024 | GPULogical Reasoning | CodeCode Available | 2 |
| ManiSkill-HAB: A Benchmark for Low-Level Manipulation in Home Rearrangement Tasks | Dec 9, 2024 | GPUImitation Learning | CodeCode Available | 2 |