SOTAVerified

Math

Papers

Showing 76100 of 1596 papers

TitleStatusHype
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
Spurious Rewards: Rethinking Training Signals in RLVRCode3
Self-Discover: Large Language Models Self-Compose Reasoning StructuresCode3
Large Language Monkeys: Scaling Inference Compute with Repeated SamplingCode3
Llemma: An Open Language Model For MathematicsCode3
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMsCode3
Training Verifiers to Solve Math Word ProblemsCode3
Reinforcement Learning for Reasoning in Large Language Models with One Training ExampleCode3
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time ScalingCode3
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem ProvingCode3
General-Reasoner: Advancing LLM Reasoning Across All DomainsCode3
Rho-1: Not All Tokens Are What You NeedCode3
Program of Thoughts Prompting: Disentangling Computation from Reasoning for Numerical Reasoning TasksCode3
PAL: Program-aided Language ModelsCode3
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
Noise Contrastive Alignment of Language Models with Explicit RewardsCode3
How Abilities in Large Language Models are Affected by Supervised Fine-tuning Data CompositionCode3
MathCoder-VL: Bridging Vision and Code for Enhanced Multimodal Mathematical ReasoningCode3
MathArena: Evaluating LLMs on Uncontaminated Math CompetitionsCode3
BAdam: A Memory Efficient Full Parameter Optimization Method for Large Language ModelsCode3
Scaling up Masked Diffusion Models on TextCode3
Dynamic Cheatsheet: Test-Time Learning with Adaptive MemoryCode3
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM FinetuningCode3
Show:102550
← PrevPage 4 of 64Next →

No leaderboard results yet.