SOTAVerified

GSM8K

Papers

Showing 2130 of 439 papers

TitleStatusHype
Automatic Instruction Evolving for Large Language ModelsCode3
Scaling up Masked Diffusion Models on TextCode3
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language ModelsCode3
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by StepCode3
MARIO: MAth Reasoning with code Interpreter Output -- A Reproducible PipelineCode3
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference LearningCode3
LayerSkip: Enabling Early Exit Inference and Self-Speculative DecodingCode3
Beyond Chain-of-Thought, Effective Graph-of-Thought Reasoning in Language ModelsCode3
LoRA-GA: Low-Rank Adaptation with Gradient ApproximationCode3
Show:102550
← PrevPage 3 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified