| PARAMANU-GANITA: Language Model with Mathematical Capabilities | Apr 22, 2024 | Domain AdaptationGSM8K | —Unverified | 0 | 0 |
| Parameter-Efficient Checkpoint Merging via Metrics-Weighted Averaging | Apr 23, 2025 | Mathematical Reasoningparameter-efficient fine-tuning | —Unverified | 0 | 0 |
| Path-Consistency: Prefix Enhancement for Efficient Inference in LLM | Aug 25, 2024 | Code GenerationCommon Sense Reasoning | —Unverified | 0 | 0 |
| Path Planning for Masked Diffusion Model Sampling | Feb 5, 2025 | Code GenerationIn-Context Learning | —Unverified | 0 | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 | 0 |
| PhysUniBench: An Undergraduate-Level Physics Reasoning Benchmark for Multimodal Models | Jun 21, 2025 | Mathematical ReasoningMultiple-choice | —Unverified | 0 | 0 |
| Pi-GPS: Enhancing Geometry Problem Solving by Unleashing the Power of Diagrammatic Information | Mar 7, 2025 | Geometry Problem SolvingMathematical Reasoning | —Unverified | 0 | 0 |
| Plug-and-Play Training Framework for Preference Optimization | Dec 30, 2024 | Mathematical ReasoningQuestion Answering | —Unverified | 0 | 0 |
| Policy Guided Tree Search for Enhanced LLM Reasoning | Feb 4, 2025 | Mathematical ReasoningNavigate | —Unverified | 0 | 0 |
| PolyMath: Evaluating Mathematical Reasoning in Multilingual Contexts | Apr 25, 2025 | DiversityMathematical Reasoning | —Unverified | 0 | 0 |