SOTAVerified

Math

Papers

Showing 2650 of 1596 papers

TitleStatusHype
Qwen Technical ReportCode6
Mistral 7BCode6
GPT-4 Technical ReportCode6
Chain-of-Thought Prompting Elicits Reasoning in Large Language ModelsCode6
Process Reinforcement through Implicit RewardsCode5
LiveBench: A Challenging, Contamination-Limited LLM BenchmarkCode5
MARIO Eval: Evaluate Your Math LLM with your Math LLM--A mathematical dataset evaluation toolkitCode5
OpenR: An Open Source Framework for Advanced Reasoning with Large Language ModelsCode5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
Common 7B Language Models Already Possess Strong Math CapabilitiesCode5
Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language ModelsCode5
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructCode5
LIMO: Less is More for ReasoningCode5
Evolutionary Optimization of Model Merging RecipesCode5
Free Process Rewards without Process LabelsCode5
Reinforcement Learning from Human FeedbackCode5
Dive into Deep LearningCode4
LLaMA Pro: Progressive LLaMA with Block ExpansionCode4
Lean Workbook: A large-scale Lean problem set formalized from natural language math problemsCode4
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN proverCode4
Mutual Reasoning Makes Smaller LLMs Stronger Problem-SolversCode4
Let's Verify Step by StepCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN ProblemsCode4
How is ChatGPT's behavior changing over time?Code4
Show:102550
← PrevPage 2 of 64Next →

No leaderboard results yet.