SOTAVerified

GSM8K

Papers

Showing 331340 of 439 papers

TitleStatusHype
Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models0
Fine-Grained Self-Endorsement Improves Factuality and Reasoning0
Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and DistillationCode1
SymBa: Symbolic Backward Chaining for Structured Natural Language Reasoning0
Reformatted AlignmentCode2
Orca-Math: Unlocking the potential of SLMs in Grade School Math0
Language Models as Science TutorsCode1
Can Separators Improve Chain-of-Thought Prompting?0
OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning DatasetCode4
Premise Order Matters in Reasoning with Large Language Models0
Show:102550
← PrevPage 34 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified