| Practice with Graph-based ANN Algorithms on Sparse Data: Chi-square Two-tower model, HNSW, Sign Cauchy Projections | Jun 13, 2023 | GPU | —Unverified | 0 |
| PREBA: A Hardware/Software Co-Design for Multi-Instance GPU based AI Inference Servers | Nov 28, 2024 | GPU | —Unverified | 0 |
| Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Mar 6, 2025 | GPUHyperparameter Optimization | —Unverified | 0 |
| Predicting Efficiency/Effectiveness Trade-offs for Dense vs. Sparse Retrieval Strategy Selection | Sep 22, 2021 | GPUInformation Retrieval | —Unverified | 0 |
| Prediction-Assisted Online Distributed Deep Learning Workload Scheduling in GPU Clusters | Jan 9, 2025 | GPUScheduling | —Unverified | 0 |
| PredictionNet: Real-Time Joint Probabilistic Traffic Prediction for Planning, Control, and Simulation | Sep 23, 2021 | Autonomous DrivingGPU | —Unverified | 0 |
| Prediction of GPU Failures Under Deep Learning Workloads | Jan 27, 2022 | Deep LearningGPU | —Unverified | 0 |
| PRE-NAS: Predictor-assisted Evolutionary Neural Architecture Search | Apr 27, 2022 | GPUNeural Architecture Search | —Unverified | 0 |
| PreSto: An In-Storage Data Preprocessing System for Training Recommendation Models | Jun 11, 2024 | CPUGPU | —Unverified | 0 |
| Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference | Mar 12, 2025 | BlockingGPU | —Unverified | 0 |
| Prism: Unleashing GPU Sharing for Cost-Efficient Multi-LLM Serving | May 6, 2025 | GPUScheduling | —Unverified | 0 |
| Privacy preserving Neural Network Inference on Encrypted Data with GPUs | Nov 26, 2019 | BIG-bench Machine LearningCloud Computing | —Unverified | 0 |
| Privacy-Preserving Text Classification on BERT Embeddings with Homomorphic Encryption | Oct 5, 2022 | ClassificationGPU | —Unverified | 0 |
| Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption | May 12, 2025 | GPUKnowledge Base Question Answering | —Unverified | 0 |
| PrivateLoRA For Efficient Privacy Preserving LLM | Nov 23, 2023 | GPULanguage Modelling | —Unverified | 0 |
| PrivFT: Private and Fast Text Classification with Homomorphic Encryption | Aug 19, 2019 | ClassificationCPU | —Unverified | 0 |
| ProAI: An Efficient Embedded AI Hardware for Automotive Applications -- a Benchmark Study | Aug 11, 2021 | Autonomous DrivingCPU | —Unverified | 0 |
| Probabilistic Deep Learning using Random Sum-Product Networks | Jun 5, 2018 | Deep LearningGPU | —Unverified | 0 |
| Probabilistic hypergraph grammars for efficient molecular optimization | Jun 5, 2019 | GPUreinforcement-learning | —Unverified | 0 |
| Probabilistic Inference of Simulation Parameters via Parallel Differentiable Simulation | Sep 18, 2021 | Bayesian InferenceCode Generation | —Unverified | 0 |
| Probe-based Rapid Hybrid Hyperspectral and Tissue Surface Imaging Aided by Fully Convolutional Networks | Jun 15, 2016 | GPU | —Unverified | 0 |
| Processing Energy Modeling for Neural Network Based Image Compression | Jun 29, 2023 | GPUImage Compression | —Unverified | 0 |
| Productive Reproducible Workflows for DNNs: A Case Study for Industrial Defect Detection | Jun 19, 2022 | CPUDefect Detection | —Unverified | 0 |
| Profiling based Out-of-core Hybrid Method for Large Neural Networks | Jul 11, 2019 | GPU | —Unverified | 0 |
| Progressively refined deep joint registration segmentation (ProRSeg) of gastrointestinal organs at risk: Application to MRI and cone-beam CT | Oct 25, 2022 | GPU | —Unverified | 0 |
| ProMoE: Fast MoE-based LLM Serving using Proactive Caching | Oct 29, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs | Oct 14, 2024 | GPURecommendation Systems | —Unverified | 0 |
| Prompts to Summaries: Zero-Shot Language-Guided Video Summarization | Jun 12, 2025 | GPUQuery focused video summarization | —Unverified | 0 |
| Properties on n-dimensional convolution for image deconvolution | Nov 30, 2017 | DenoisingGPU | —Unverified | 0 |
| PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks | Nov 6, 2024 | Binary ClassificationGPU | —Unverified | 0 |
| Protea: Client Profiling within Federated Systems using Flower | Jul 3, 2022 | Federated LearningGPU | —Unverified | 0 |
| ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Sep 21, 2024 | GPUMachine Translation | —Unverified | 0 |
| Protecting Confidentiality, Privacy and Integrity in Collaborative Learning | Dec 11, 2024 | CPUGPU | —Unverified | 0 |
| ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning | Jun 9, 2025 | DiversityGPU | —Unverified | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.0 | May 25, 2021 | ClusteringCPU | —Unverified | 0 |
| Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging | Dec 25, 2023 | GPU | —Unverified | 0 |
| Pruned RNN-T for fast, memory-efficient ASR training | Jun 23, 2022 | DecoderGPU | —Unverified | 0 |
| Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN | Sep 25, 2019 | CPUGPU | —Unverified | 0 |
| Pruning Compact ConvNets for Efficient Inference | Jan 11, 2023 | GPUNetwork Pruning | —Unverified | 0 |
| Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation | Dec 28, 2024 | CPUGPU | —Unverified | 0 |
| Pushing the Limits of Beam Search Decoding for Transducer-based ASR models | May 30, 2025 | GPU | —Unverified | 0 |
| Pushing the Limits of BFP on Narrow Precision LLM Inference | Jan 21, 2025 | GPU | —Unverified | 0 |
| Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Nov 28, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection | Sep 1, 2018 | GPUObject | —Unverified | 0 |
| Python Workflows on HPC Systems | Dec 1, 2020 | GPU | —Unverified | 0 |
| PZnet: Efficient 3D ConvNet Inference on Manycore CPUs | Mar 18, 2019 | CPUGPU | —Unverified | 0 |
| QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning | Feb 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources | Oct 11, 2023 | GPUparameter-efficient fine-tuning | —Unverified | 0 |