| Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation | Dec 28, 2024 | CPUGPU | —Unverified | 0 |
| Pushing the Limits of Beam Search Decoding for Transducer-based ASR models | May 30, 2025 | GPU | —Unverified | 0 |
| Pushing the Limits of BFP on Narrow Precision LLM Inference | Jan 21, 2025 | GPU | —Unverified | 0 |
| Puzzle: Distillation-Based NAS for Inference-Optimized LLMs | Nov 28, 2024 | GPUKnowledge Distillation | —Unverified | 0 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Pyramid Dilated Deeper ConvLSTM for Video Salient Object Detection | Sep 1, 2018 | GPUObject | —Unverified | 0 |
| Python Workflows on HPC Systems | Dec 1, 2020 | GPU | —Unverified | 0 |
| PZnet: Efficient 3D ConvNet Inference on Manycore CPUs | Mar 18, 2019 | CPUGPU | —Unverified | 0 |
| QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning | Feb 16, 2024 | GPULanguage Modeling | —Unverified | 0 |
| QFT: Quantized Full-parameter Tuning of LLMs with Affordable Resources | Oct 11, 2023 | GPUparameter-efficient fine-tuning | —Unverified | 0 |