SOTAVerified

GSM8K

Papers

Showing 351360 of 439 papers

TitleStatusHype
S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models0
Uncovering Latent Chain of Thought Vectors in Language Models0
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models0
Cool-Fusion: Fuse Large Language Models without Training0
ControlMath: Controllable Data Generation Promotes Math Generalist Models0
Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On0
Slimming Down LLMs Without Losing Their Minds0
Contrastive Decoding Improves Reasoning in Large Language Models0
Concise Thoughts: Impact of Output Length on LLM Reasoning and Cost0
Adaptive Dense Reward: Understanding the Gap Between Action and Reward Space in Alignment0
Show:102550
← PrevPage 36 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified