| Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers | May 20, 2025 | GPUVideo Generation | CodeCode Available | 2 |
| Accelerating Sparse Deep Neural Networks | Apr 16, 2021 | GPUMath | CodeCode Available | 2 |
| GPTAQ: Efficient Finetuning-Free Quantization for Asymmetric Calibration | Apr 3, 2025 | GPUQuantization | CodeCode Available | 2 |
| An Efficient Sparse Kernel Generator for O(3)-Equivariant Deep Networks | Jan 23, 2025 | GPU | CodeCode Available | 2 |
| Differentiable Voxelization and Mesh Morphing | Jul 15, 2024 | GPU | CodeCode Available | 2 |
| nvTorchCam: An Open-source Library for Camera-Agnostic Differentiable Geometric Vision | Oct 15, 2024 | Deep LearningGPU | CodeCode Available | 2 |
| ArchesWeather & ArchesWeatherGen: a deterministic and generative model for efficient ML weather forecasting | Dec 17, 2024 | GPUWeather Forecasting | CodeCode Available | 2 |
| CrypTen: Secure Multi-Party Computation Meets Machine Learning | Sep 2, 2021 | BIG-bench Machine LearningGPU | CodeCode Available | 2 |
| DiG: Scalable and Efficient Diffusion Models with Gated Linear Attention | May 28, 2024 | GPUMamba | CodeCode Available | 2 |
| GPU Performance Portability needs Autotuning | Apr 30, 2025 | GPU | CodeCode Available | 2 |