| Pathogen Infection Recovery Probability (PIRP) Versus Proinflammatory Anti-Pathogen Species (PIAPS) Levels: Modelling and Therapeutic Strategies | Mar 11, 2020 | Math | —Unverified | 0 |
| PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics | May 29, 2025 | Math | —Unverified | 0 |
| Why are NLP Models Fumbling at Elementary Math? A Survey of Automatic Word Problem Solvers | Jan 16, 2022 | MathMathematical Reasoning | —Unverified | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 |
| Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning | Jun 5, 2025 | MathVisual Grounding | —Unverified | 0 |
| Performance Analysis and Improvement of Parallel Differential Evolution | Jan 17, 2021 | global-optimizationMath | —Unverified | 0 |
| Performance Comparison of Large Language Models on Advanced Calculus Problems | Mar 5, 2025 | MathMathematical Problem-Solving | —Unverified | 0 |
| Performance Evaluation and Optimization of Math-Similarity Search | May 29, 2015 | Math | —Unverified | 0 |
| Performance, Opaqueness, Consequences, and Assumptions: Simple questions for responsible planning of machine learning solutions | Aug 21, 2022 | Math | —Unverified | 0 |
| Permutation Complexity Bound on Out-Sample Error | Dec 1, 2010 | Math | —Unverified | 0 |
| Permuted and Unlinked Monotone Regression in R^d: an approach based on mixture modeling and optimal transport | Jan 10, 2022 | DenoisingMath | —Unverified | 0 |
| Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems | Apr 10, 2024 | Math | —Unverified | 0 |
| PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation | Oct 2, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| Pheromone-based Learning of Optimal Reasoning Paths | Jan 31, 2025 | ARCGSM8K | —Unverified | 0 |
| Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone | Apr 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math | Apr 30, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Phi-4-reasoning Technical Report | Apr 30, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries | Apr 27, 2025 | Automated Theorem ProvingBug fixing | —Unverified | 0 |
| Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems | Aug 29, 2024 | Math | —Unverified | 0 |
| PixelWorld: Towards Perceiving Everything as Pixels | Jan 31, 2025 | Math | —Unverified | 0 |
| 構建一個中文國小數學文字問題語料庫(Building a Corpus for Developing the Chinese Elementary School Math Word Problem Solver)[In Chinese] | Oct 1, 2016 | Math | —Unverified | 0 |
| Plan for Speed -- Dilated Scheduling for Masked Diffusion Language Models | Jun 23, 2025 | Code CompletionGSM8K | —Unverified | 0 |
| Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning | Mar 13, 2025 | In-Context LearningMath | —Unverified | 0 |
| An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem | Mar 17, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning | Sep 25, 2024 | GSM8KMath | —Unverified | 0 |
| Polyak's Heavy Ball Method Achieves Accelerated Local Rate of Convergence under Polyak-Lojasiewicz Inequality | Oct 22, 2024 | Math | —Unverified | 0 |
| Prediction with Expert Advice: a PDE Perspective | Apr 25, 2019 | MathPrediction | —Unverified | 0 |
| A novel variational model for image registration using Gaussian curvature | Apr 28, 2015 | Image RegistrationMath | —Unverified | 0 |
| Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs | Feb 4, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| Understanding the Progression of Educational Topics via Semantic Matching | Feb 10, 2024 | Math | —Unverified | 0 |
| Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning | May 26, 2025 | DiversityMath | —Unverified | 0 |
| Why are NLP Models Fumbling at Elementary Math? A Survey of Deep Learning based Word Problem Solvers | May 31, 2022 | MathMathematical Reasoning | —Unverified | 0 |
| ProblemSolver at SemEval-2019 Task 10: Sequence-to-Sequence Learning and Expression Trees | Jun 1, 2019 | MathQuestion Answering | —Unverified | 0 |
| A note on the option price and 'Mass at zero in the uncorrelated SABR model and implied volatility asymptotics' | Nov 1, 2020 | MathNumerical Integration | —Unverified | 0 |
| An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests | Jan 21, 2025 | Math | —Unverified | 0 |
| An Optimal Likelihood Free Method for Biological Model Selection | Aug 3, 2022 | Drug DiscoveryMath | —Unverified | 0 |
| A Nonlocal Graph-PDE and Higher-Order Geometric Integration for Image Labeling | May 9, 2022 | Math | —Unverified | 0 |
| Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment | Jun 17, 2024 | Logical ReasoningMath | —Unverified | 0 |
| Prompt Baking | Sep 4, 2024 | ARCGSM8K | —Unverified | 0 |
| PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts | Jun 7, 2023 | Cross-Lingual Paraphrase IdentificationMachine Translation | —Unverified | 0 |
| Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap | Jan 5, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| PromptHive: Bringing Subject Matter Experts Back to the Forefront with Collaborative Prompt Engineering for Educational Content Creation | Oct 21, 2024 | MathPrompt Engineering | —Unverified | 0 |
| Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad | Mar 27, 2025 | MathMathematical Reasoning | —Unverified | 0 |
| PRSA: Prompt Stealing Attacks against Real-World Prompt Services | Feb 29, 2024 | Math | —Unverified | 0 |
| Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers | May 7, 2025 | MathReinforcement Learning (RL) | —Unverified | 0 |
| Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning | Jun 20, 2024 | GSM8KHeuristic Search | —Unverified | 0 |
| QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning | Aug 20, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |
| Quantitative Methods for Optimizing Patient Outcomes in Liver Transplantation | May 31, 2023 | ManagementMath | —Unverified | 0 |
| An Improved Coarse-to-Fine Method for Solving Generation Tasks | Apr 1, 2019 | MathMath Word Problem Solving | —Unverified | 0 |
| Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning | Jan 6, 2025 | MathMathematical Reasoning | —Unverified | 0 |