| VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion | Feb 23, 2023 | 3D geometry3D Semantic Scene Completion | CodeCode Available | 3 |
| Cramming: Training a Language Model on a Single GPU in One Day | Dec 28, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| MegaBlocks: Efficient Sparse Training with Mixture-of-Experts | Nov 29, 2022 | GPUMixture-of-Experts | CodeCode Available | 3 |
| What Language Model to Train if You Have One Million GPU Hours? | Oct 27, 2022 | GPULanguage Modeling | CodeCode Available | 3 |
| A GPU-specialized Inference Parameter Server for Large-Scale Deep Recommendation Models | Oct 17, 2022 | CPUGPU | CodeCode Available | 3 |
| PyTorch Image Quality: Metrics for Image Quality Assessment | Aug 31, 2022 | GPUImage Quality Assessment | CodeCode Available | 3 |
| USB: A Unified Semi-supervised Learning Benchmark for Classification | Aug 12, 2022 | General ClassificationGPU | CodeCode Available | 3 |
| ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech | Jul 13, 2022 | DenoisingGPU | CodeCode Available | 3 |
| ASE: Large-Scale Reusable Adversarial Skill Embeddings for Physically Simulated Characters | May 4, 2022 | GPUImitation Learning | CodeCode Available | 3 |
| Fast Sampling of Diffusion Models with Exponential Integrator | Apr 29, 2022 | GPU | CodeCode Available | 3 |
| Fast-MD: Fast Multi-Decoder End-to-End Speech Translation with Non-Autoregressive Hidden Intermediates | Sep 27, 2021 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 3 |
| Robust High-Resolution Video Matting with Temporal Guidance | Aug 25, 2021 | 4kGPU | CodeCode Available | 3 |
| Real-Time High-Resolution Background Matting | Dec 14, 2020 | 4kGPU | CodeCode Available | 3 |
| Biomedical and Clinical English Model Packages in the Stanza Python NLP Library | Jul 29, 2020 | GPUNamed Entity Recognition | CodeCode Available | 3 |
| Generalized Focal Loss: Learning Qualified and Distributed Bounding Boxes for Dense Object Detection | Jun 8, 2020 | Dense Object DetectionGeneral Classification | CodeCode Available | 3 |
| U^2-Net: Going Deeper with Nested U-Structure for Salient Object Detection | May 18, 2020 | Dichotomous Image SegmentationGPU | CodeCode Available | 3 |
| Machine Learning in Python: Main developments and technology trends in data science, machine learning, and artificial intelligence | Feb 12, 2020 | BIG-bench Machine LearningGPU | CodeCode Available | 3 |
| mlpack 3: a fast, flexible machine learning library | Jun 18, 2018 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 3 |
| Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded Modes | Aug 29, 2017 | BIG-bench Machine LearningCPU | CodeCode Available | 3 |
| U-Net: Convolutional Networks for Biomedical Image Segmentation | May 18, 2015 | Cell SegmentationCell Tracking | CodeCode Available | 3 |
| AutoTriton: Automatic Triton Programming with Reinforcement Learning in LLMs | Jul 8, 2025 | GPUreinforcement-learning | CodeCode Available | 2 |
| any4: Learned 4-bit Numeric Representation for LLMs | Jul 7, 2025 | GPUGSM8K | CodeCode Available | 2 |
| MathOptAI.jl: Embed trained machine learning predictors into JuMP models | Jul 3, 2025 | CPUGaussian Processes | CodeCode Available | 2 |
| VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and Collisions | Jun 29, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation | Jun 29, 2025 | GPUOptical Flow Estimation | CodeCode Available | 2 |