| Hard Mixtures of Experts for Large Scale Weakly Supervised Vision | Apr 20, 2017 | GPUMixture-of-Experts | —Unverified | 0 |
| Impact of GPU uncertainty on the training of predictive deep neural networks | Sep 3, 2021 | CPUGPU | —Unverified | 0 |
| An Analysis of Collocation on GPUs for Deep Learning Training | Sep 13, 2022 | Deep LearningGPU | —Unverified | 0 |
| Impact of ML Optimization Tactics on Greener Pre-Trained ML Models | Sep 19, 2024 | GPUimage-classification | —Unverified | 0 |
| Automated Root Causing of Cloud Incidents using In-Context Learning with GPT-4 | Jan 24, 2024 | GPUIn-Context Learning | —Unverified | 0 |
| BASS: Batched Attention-optimized Speculative Sampling | Apr 24, 2024 | GPUHumanEval | —Unverified | 0 |
| Implementation of Parallel Simplified Swarm Optimization in CUDA | Oct 1, 2021 | GPU | —Unverified | 0 |
| Implementation of Real-Time Automotive SAR Imaging | Jun 16, 2023 | GPU | —Unverified | 0 |
| LANA: Latency Aware Network Acceleration | Jul 12, 2021 | CPUGPU | —Unverified | 0 |
| HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks | Jan 20, 2023 | GPULow-rank compression | —Unverified | 0 |