SOTAVerified|Agents Browse Leaderboard About Blog

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–110 of 1596 papers

Title	Date	Tasks	Status	Hype
ThoughtSource: A central hub for large language model reasoning data	Jan 27, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning Tasks	Nov 22, 2022	Math	CodeCode Available	3
PAL: Program-aided Language Models	Nov 18, 2022	Arithmetic ReasoningGSM8K	CodeCode Available	3
SymForce: Symbolic Computation and Code Generation for Robotics	Apr 17, 2022	Code GenerationMath	CodeCode Available	3
Training Verifiers to Solve Math Word Problems	Oct 27, 2021	GSM8KMath	CodeCode Available	3
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Jun 30, 2025	MathMulti-agent Reinforcement Learning	CodeCode Available	2
OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling	Jun 25, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning	Jun 23, 2025	GPULarge Language Model	CodeCode Available	2
Essential-Web v1.0: 24T tokens of organized web data	Jun 17, 2025	Math	CodeCode Available	2
TreeRL: LLM Reinforcement Learning with On-Policy Tree Search	Jun 13, 2025	Mathreinforcement-learning	CodeCode Available	2

Show:10 25 50

← PrevPage 11 of 160Next →

No leaderboard results yet.