| High-Resolution Image Synthesis with Latent Diffusion Models | Dec 20, 2021 | DenoisingGPU | CodeCode Available | 4 |
| Colossal-AI: A Unified Deep Learning System For Large-Scale Parallel Training | Oct 28, 2021 | Deep LearningGPU | CodeCode Available | 4 |
| GPUTreeShap: Massively Parallel Exact Calculation of SHAP Scores for Tree Ensembles | Oct 27, 2020 | BIG-bench Machine LearningCPU | CodeCode Available | 4 |
| fastai: A Layered API for Deep Learning | Feb 11, 2020 | Deep LearningGPU | CodeCode Available | 4 |
| XGBoost: Scalable GPU Accelerated Learning | Jun 29, 2018 | Cloud ComputingData Compression | CodeCode Available | 4 |
| Billion-scale similarity search with GPUs | Feb 28, 2017 | GPUImage Similarity Search | CodeCode Available | 4 |
| FourCastNet 3: A geometric approach to probabilistic machine-learning weather forecasting at scale | Jul 16, 2025 | Computational EfficiencyGPU | CodeCode Available | 3 |
| Arctic Inference with Shift Parallelism: Fast and Efficient Open Source Inference System for Enterprise AI | Jul 16, 2025 | GPU | CodeCode Available | 3 |
| Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised Models | Jun 23, 2025 | Domain AdaptationGPU | CodeCode Available | 3 |
| ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation | Jun 22, 2025 | GPUImage Generation | CodeCode Available | 3 |