SOTAVerified

GSM8K

Papers

Showing 426439 of 439 papers

TitleStatusHype
Interpretable Math Word Problem Solution Generation Via Step-by-step Planning0
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic SystemsCode0
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuningCode0
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
Self-Evaluation Guided Beam Search for Reasoning0
Teaching Small Language Models to Reason0
Distilling Reasoning Capabilities into Smaller Language ModelsCode0
Explicit Knowledge Transfer for Weakly-Supervised Code Generation0
Solving math word problems with process- and outcome-based feedback0
Large Language Models Can Self-Improve0
Transcending Scaling Laws with 0.1% Extra Compute0
Complexity-Based Prompting for Multi-Step Reasoning0
Making Large Language Models Better Reasoners with Step-Aware Verifier0
Show:102550
← PrevPage 18 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified