SOTAVerified

GSM8K

Papers

Showing 411420 of 439 papers

TitleStatusHype
Matrix Information Theory for Self-Supervised LearningCode1
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language ModelsCode3
GRACE: Discriminator-Guided Chain-of-Thought ReasoningCode1
Calc-X and Calcformers: Empowering Arithmetical Chain-of-Thought through Interaction with Symbolic SystemsCode0
Self-Polish: Enhance Reasoning in Large Language Models via Problem RefinementCode1
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuningCode0
Automatic Model Selection with Large Language Models for ReasoningCode1
RCOT: Detecting and Rectifying Factual Inconsistency in Reasoning by Reversing Chain-of-Thought0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
Self-Evaluation Guided Beam Search for Reasoning0
Show:102550
← PrevPage 42 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified