SOTAVerified

College Mathematics

Papers

Showing 13 of 3 papers

TitleStatusHype
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
Scaling Language Models: Methods, Analysis & Insights from Training GopherCode2
Effectiveness of Zero-shot-CoT in Japanese Prompts0
Show:102550

No leaderboard results yet.