SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 1596 papers

Title	Date	Tasks	Status	Hype
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data	Oct 2, 2024	Arithmetic ReasoningLarge Language Model	CodeCode Available	4
Mutual Reasoning Makes Smaller LLMs Stronger Problem-Solvers	Aug 12, 2024	GSM8KMath	CodeCode Available	4
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN prover	Jul 24, 2024	Automated Theorem ProvingMath	CodeCode Available	4
MAVIS: Mathematical Visual Instruction Tuning with an Automatic Data Engine	Jul 11, 2024	Contrastive LearningLanguage Modelling	CodeCode Available	4
Lean Workbook: A large-scale Lean problem set formalized from natural language math problems	Jun 6, 2024	Automated Theorem ProvingMath	CodeCode Available	4
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset	Feb 15, 2024	Arithmetic ReasoningGSM8K	CodeCode Available	4
InternLM-Math: Open Math Large Language Models Toward Verifiable Reasoning	Feb 9, 2024	Data AugmentationGSM8K	CodeCode Available	4
ReFT: Reasoning with Reinforced Fine-Tuning	Jan 17, 2024	GSM8KMath	CodeCode Available	4
LLaMA Pro: Progressive LLaMA with Block Expansion	Jan 4, 2024	Instruction FollowingMath	CodeCode Available	4
How is ChatGPT's behavior changing over time?	Jul 18, 2023	Code GenerationLanguage Modelling	CodeCode Available	4
Let's Verify Step by Step	May 31, 2023	Active LearningMath	CodeCode Available	4
Reasoning with Language Model is Planning with World Model	May 24, 2023	Language ModelingLanguage Modelling	CodeCode Available	4
Galactica: A Large Language Model for Science	Nov 16, 2022	AnachronismsBias Detection	CodeCode Available	4
Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models	Jun 9, 2022	Common Sense ReasoningMath	CodeCode Available	4
Dive into Deep Learning	Jun 21, 2021	Deep LearningMath	CodeCode Available	4
Spurious Rewards: Rethinking Training Signals in RLVR	Jun 12, 2025	MathMathematical Reasoning	CodeCode Available	3
MathArena: Evaluating LLMs on Uncontaminated Math Competitions	May 29, 2025	MathMathematical Reasoning	CodeCode Available	3
General-Reasoner: Advancing LLM Reasoning Across All Domains	May 20, 2025	AllMath	CodeCode Available	3
Thinkless: LLM Learns When to Think	May 19, 2025	GSM8KMath	CodeCode Available	3
MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical Reasoning	May 15, 2025	cross-modal alignmentGeometry Problem Solving	CodeCode Available	3
Reinforcement Learning for Reasoning in Large Language Models with One Training Example	Apr 29, 2025	Domain GeneralizationMath	CodeCode Available	3
An Empirical Study on Prompt Compression for Large Language Models	Apr 24, 2025	ArticlesMath	CodeCode Available	3
Learning to Reason under Off-Policy Guidance	Apr 21, 2025	MathReinforcement Learning (RL)	CodeCode Available	3
Syzygy of Thoughts: Improving LLM CoT with the Minimal Free Resolution	Apr 13, 2025	GSM8KMath	CodeCode Available	3
Dynamic Cheatsheet: Test-Time Learning with Adaptive Memory	Apr 10, 2025	MathMMLU	CodeCode Available	3

Show:10 25 50

← PrevPage 3 of 64Next →

No leaderboard results yet.