| Once-for-All: Train One Network and Specialize it for Efficient Deployment | Aug 26, 2019 | AllAutoML | CodeCode Available | 1 |
| Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression | Mar 23, 2024 | Dimensionality ReductionGPU | CodeCode Available | 1 |
| Neural Architecture Search using Deep Neural Networks and Monte Carlo Tree Search | May 18, 2018 | GPUImage Captioning | CodeCode Available | 1 |
| Fast Nonlinear Vector Quantile Regression | May 30, 2022 | GPUquantile regression | CodeCode Available | 1 |
| One Model to Reconstruct Them All: A Novel Way to Use the Stochastic Noise in StyleGAN | Oct 21, 2020 | AllDecoder | CodeCode Available | 1 |
| Queue management for slo-oriented large language model serving | Jun 5, 2024 | BlockingGPU | CodeCode Available | 1 |
| BenchPress: A Deep Active Benchmark Generator | Aug 13, 2022 | Active LearningCPU | CodeCode Available | 1 |
| Fast Model Editing at Scale | Oct 21, 2021 | GPULanguage Modelling | CodeCode Available | 1 |
| ConsistNet: Enforcing 3D Consistency for Multi-view Images Diffusion | Oct 16, 2023 | Depth EstimationDepth Prediction | CodeCode Available | 1 |
| FastONN -- Python based open-source GPU implementation for Operational Neural Networks | Jun 3, 2020 | GPU | CodeCode Available | 1 |