SOTAVerified

Math

Papers

Showing 5175 of 1596 papers

TitleStatusHype
Energy-Based Transformers are Scalable Learners and ThinkersCode4
SuperCorrect: Supervising and Correcting Language Models with Error-Driven InsightsCode4
Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and BeyondCode4
Lean Workbook: A large-scale Lean problem set formalized from natural language math problemsCode4
Let's Verify Step by StepCode4
Dive into Deep LearningCode4
ReFT: Reasoning with Reinforced Fine-TuningCode4
InternLM2.5-StepProver: Advancing Automated Theorem Proving via Expert Iteration on Large-Scale LEAN ProblemsCode4
InternLM-Math: Open Math Large Language Models Toward Verifiable ReasoningCode4
LLaMA Pro: Progressive LLaMA with Block ExpansionCode4
CodeI/O: Condensing Reasoning Patterns via Code Input-Output PredictionCode4
How is ChatGPT's behavior changing over time?Code4
MM-PRM: Enhancing Multimodal Mathematical Reasoning with Scalable Step-Level SupervisionCode4
Skywork Open Reasoner 1 Technical ReportCode4
LEAN-GitHub: Compiling GitHub LEAN repositories for a versatile LEAN proverCode4
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem ProvingCode3
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time ScalingCode3
General-Reasoner: Advancing LLM Reasoning Across All DomainsCode3
MM-Vet v2: A Challenging Benchmark to Evaluate Large Multimodal Models for Integrated CapabilitiesCode3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksCode3
PAL: Program-aided Language ModelsCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
Noise Contrastive Alignment of Language Models with Explicit RewardsCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Show:102550
← PrevPage 3 of 64Next →

No leaderboard results yet.