| ProMoE: Fast MoE-based LLM Serving using Proactive Caching | Oct 29, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs | Oct 14, 2024 | GPURecommendation Systems | —Unverified | 0 |
| Prompts to Summaries: Zero-Shot Language-Guided Video Summarization | Jun 12, 2025 | GPUQuery focused video summarization | —Unverified | 0 |
| Properties on n-dimensional convolution for image deconvolution | Nov 30, 2017 | DenoisingGPU | —Unverified | 0 |
| PropNEAT -- Efficient GPU-Compatible Backpropagation over NeuroEvolutionary Augmenting Topology Networks | Nov 6, 2024 | Binary ClassificationGPU | —Unverified | 0 |
| Protea: Client Profiling within Federated Systems using Flower | Jul 3, 2022 | Federated LearningGPU | —Unverified | 0 |
| ProTEA: Programmable Transformer Encoder Acceleration on FPGA | Sep 21, 2024 | GPUMachine Translation | —Unverified | 0 |
| Protecting Confidentiality, Privacy and Integrity in Collaborative Learning | Dec 11, 2024 | CPUGPU | —Unverified | 0 |
| ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning | Jun 9, 2025 | DiversityGPU | —Unverified | 0 |
| ProTrain: Efficient LLM Training via Memory-Aware Techniques | Jun 12, 2024 | CPUGPU | —Unverified | 0 |
| Providing Meaningful Data Summarizations Using Exemplar-based Clustering in Industry 4.0 | May 25, 2021 | ClusteringCPU | —Unverified | 0 |
| Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging | Dec 25, 2023 | GPU | —Unverified | 0 |
| Pruned RNN-T for fast, memory-efficient ASR training | Jun 23, 2022 | DecoderGPU | —Unverified | 0 |
| Prune or quantize? Strategy for Pareto-optimally low-cost and accurate CNN | Sep 25, 2019 | CPUGPU | —Unverified | 0 |
| Pruning Compact ConvNets for Efficient Inference | Jan 11, 2023 | GPUNetwork Pruning | —Unverified | 0 |
| Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation | Dec 28, 2024 | CPUGPU | —Unverified | 0 |
| Pushing the Limits of Beam Search Decoding for Transducer-based ASR models | May 30, 2025 | GPU | —Unverified | 0 |
| Pushing the Limits of BFP on Narrow Precision LLM Inference | Jan 21, 2025 | GPU | —Unverified | 0 |
| Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Nov 28, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection | Sep 1, 2018 | GPUObject | —Unverified | 0 |
| Python Workflows on HPC Systems | Dec 1, 2020 | GPU | —Unverified | 0 |
| PZnet: Efficient 3D ConvNet Inference on Manycore CPUs | Mar 18, 2019 | CPUGPU | —Unverified | 0 |
| QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning | Feb 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources | Oct 11, 2023 | GPUparameter-efficient fine-tuning | —Unverified | 0 |