SOTAVerified

GSM8K

Papers

Showing 361370 of 439 papers

TitleStatusHype
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
Training Chain-of-Thought via Latent-Variable Inference0
AlignedCoT: Prompting Large Language Models via Native-Speaking DemonstrationsCode0
Meta Prompting for AI SystemsCode2
Token-Level Adaptation of LoRA Adapters for Downstream Task GeneralizationCode1
OVM, Outcome-supervised Value Models for Planning in Mathematical ReasoningCode1
Neuro-Symbolic Integration Brings Causal and Reliable Reasoning ProofsCode1
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning0
The ART of LLM Refinement: Ask, Refine, and Trust0
Show:102550
← PrevPage 37 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified