SOTAVerified

Math

Papers

Showing 9511000 of 1596 papers

TitleStatusHype
Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces0
Pathogen Infection Recovery Probability (PIRP) Versus Proinflammatory Anti-Pathogen Species (PIAPS) Levels: Modelling and Therapeutic Strategies0
PBEBench: A Multi-Step Programming by Examples Reasoning Benchmark inspired by Historical Linguistics0
Why are NLP Models Fumbling at Elementary Math? A Survey of Automatic Word Problem Solvers0
Pensez: Less Data, Better Reasoning -- Rethinking French LLM0
Perceptual Decoupling for Scalable Multi-modal Reasoning via Reward-Optimized Captioning0
Performance Analysis and Improvement of Parallel Differential Evolution0
Performance Comparison of Large Language Models on Advanced Calculus Problems0
Performance Evaluation and Optimization of Math-Similarity Search0
Performance, Opaqueness, Consequences, and Assumptions: Simple questions for responsible planning of machine learning solutions0
Permutation Complexity Bound on Out-Sample Error0
Permuted and Unlinked Monotone Regression in R^d: an approach based on mixture modeling and optimal transport0
Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems0
PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation0
Pheromone-based Learning of Optimal Reasoning Paths0
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone0
Phi-4-Mini-Reasoning: Exploring the Limits of Small Reasoning Language Models in Math0
Phi-4-reasoning Technical Report0
APE-Bench I: Towards File-level Automated Proof Engineering of Formal Math Libraries0
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems0
PixelWorld: Towards Perceiving Everything as Pixels0
構建一個中文國小數學文字問題語料庫(Building a Corpus for Developing the Chinese Elementary School Math Word Problem Solver)[In Chinese]0
Plan for Speed -- Dilated Scheduling for Masked Diffusion Language Models0
Understanding the Logical Capabilities of Large Language Models via Out-of-Context Representation Learning0
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem0
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning0
Polyak's Heavy Ball Method Achieves Accelerated Local Rate of Convergence under Polyak-Lojasiewicz Inequality0
Prediction with Expert Advice: a PDE Perspective0
A novel variational model for image registration using Gaussian curvature0
Premise-Augmented Reasoning Chains Improve Error Identification in Math reasoning with LLMs0
Understanding the Progression of Educational Topics via Semantic Matching0
Prismatic Synthesis: Gradient-based Data Diversification Boosts Generalization in LLM Reasoning0
Why are NLP Models Fumbling at Elementary Math? A Survey of Deep Learning based Word Problem Solvers0
ProblemSolver at SemEval-2019 Task 10: Sequence-to-Sequence Learning and Expression Trees0
A note on the option price and 'Mass at zero in the uncorrelated SABR model and implied volatility asymptotics'0
An Optimal Transport approach to arbitrage correction: Application to volatility Stress-Tests0
An Optimal Likelihood Free Method for Biological Model Selection0
A Nonlocal Graph-PDE and Higher-Order Geometric Integration for Image Labeling0
Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment0
Prompt Baking0
PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts0
Understand, Solve and Translate: Bridging the Multilingual Mathematical Reasoning Gap0
PromptHive: Bringing Subject Matter Experts Back to the Forefront with Collaborative Prompt Engineering for Educational Content Creation0
Proof or Bluff? Evaluating LLMs on 2025 USA Math Olympiad0
PRSA: Prompt Stealing Attacks against Real-World Prompt Services0
Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers0
Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning0
QPO: Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning0
Quantitative Methods for Optimizing Patient Outcomes in Liver Transplantation0
An Improved Coarse-to-Fine Method for Solving Generation Tasks0
Show:102550
← PrevPage 20 of 32Next →

No leaderboard results yet.