SOTAVerified

Math

Papers

Showing 6170 of 1596 papers

TitleStatusHype
How is ChatGPT's behavior changing over time?Code4
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN ProblemsCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
CodeI/O: Condensing Reasoning Patterns via Code Input-Output PredictionCode4
ReFT: Reasoning with Reinforced Fine-TuningCode4
PAL: Program-aided Language ModelsCode3
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time ScalingCode3
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated CapabilitiesCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
Show:102550
← PrevPage 7 of 160Next →

No leaderboard results yet.