SOTAVerified

Math

Papers

Showing 101110 of 1596 papers

TitleStatusHype
ThoughtSource: A central hub for large language model reasoning dataCode3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksCode3
PAL: Program-aided Language ModelsCode3
SymForce: Symbolic Computation and Code Generation for RoboticsCode3
Training Verifiers to Solve Math Word ProblemsCode3
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement LearningCode2
OctoThinker: Mid-training Incentivizes Reinforcement Learning ScalingCode2
Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics LearningCode2
Essential-Web v1.0: 24T tokens of organized web dataCode2
TreeRL: LLM Reinforcement Learning with On-Policy Tree SearchCode2
Show:102550
← PrevPage 11 of 160Next →

No leaderboard results yet.