| Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections | Jun 13, 2023 | GPU | —Unverified | 0 |
| PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers | Nov 28, 2024 | GPU | —Unverified | 0 |
| Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Mar 6, 2025 | GPUHyperparameter Optimization | —Unverified | 0 |
| Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection | Sep 22, 2021 | GPUInformation Retrieval | —Unverified | 0 |
| Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters | Jan 9, 2025 | GPUScheduling | —Unverified | 0 |
| PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation | Sep 23, 2021 | Autonomous DrivingGPU | —Unverified | 0 |
| Prediction of GPU Failures Under Deep Learning Workloads | Jan 27, 2022 | Deep LearningGPU | —Unverified | 0 |
| PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search | Apr 27, 2022 | GPUNeural Architecture Search | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference | Mar 12, 2025 | BlockingGPU | —Unverified | 0 |