SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 371–380 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Entropy-Based Adaptive Weighting for Self-Training	Mar 31, 2025	GSM8KMath	CodeCode Available	1	5
How to Get Your LLM to Generate Challenging Problems for Evaluation	Feb 20, 2025	Code CompletionMath	CodeCode Available	1	5
Entropy-Regularized Process Reward Model	Dec 15, 2024	GSM8KMath	CodeCode Available	1	5
CoMAT: Chain of Mathematically Annotated Thought Improves Mathematical Reasoning	Oct 14, 2024	MathMathematical Reasoning	CodeCode Available	1	5
Collective Constitutional AI: Aligning a Language Model with Public Input	Jun 12, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
A Categorical Archive of ChatGPT Failures	Feb 6, 2023	Math	CodeCode Available	1	5
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning	Jan 19, 2024	GSM8KMath	CodeCode Available	1	5
MedCaseReasoning: Evaluating and learning diagnostic reasoning from clinical case reports	May 16, 2025	DiagnosticMath	CodeCode Available	1	5
Mining Mathematical Documents for Question Answering via Unsupervised Formula Labeling	Nov 12, 2022	Entity LinkingKnowledge Graphs	CodeCode Available	1	5
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations	Dec 14, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	1	5

Show:10 25 50

← PrevPage 38 of 160Next →

No leaderboard results yet.