SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 831–840 of 1596 papers

Title	Date	Tasks	Status	Hype
Learning Beyond Pattern Matching? Assaying Mathematical Understanding in LLMs	May 24, 2024	In-Context LearningLanguage Modeling	—Unverified	0
Large Language Models Can Self-Correct with Key Condition Verification	May 23, 2024	Arithmetic ReasoningMath	—Unverified	0
Can LLMs Solve longer Math Word Problems Better?	May 23, 2024	Data AugmentationMath	CodeCode Available	0
JiuZhang3.0: Efficiently Improving Mathematical Reasoning by Training Small Data Synthesis Models	May 23, 2024	Knowledge DistillationMath	CodeCode Available	1
"Turing Tests" For An AI Scientist	May 22, 2024	AI AgentData Compression	—Unverified	0
Investigating Symbolic Capabilities of Large Language Models	May 21, 2024	MathNavigate	—Unverified	0
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark	May 20, 2024	College MathematicsGSM8K	CodeCode Available	2
Multiple-Choice Questions are Efficient and Robust LLM Evaluators	May 20, 2024	GSM8KHumanEval	CodeCode Available	1
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving	May 20, 2024	GSM8KMath	—Unverified	0
DOP: Diagnostic-Oriented Prompting for Large Language Models in Mathematical Correction	May 20, 2024	DiagnosticMath	CodeCode Available	0

Show:10 25 50

← PrevPage 84 of 160Next →

No leaderboard results yet.