SOTAVerified

GSM8K

Papers

Showing 426439 of 439 papers

TitleStatusHype
Training Large Language Models to Reason via EM Policy Gradient0
Large Language Models as Analogical Reasoners0
KwaiYiiMath: Technical Report0
Kwai-STaR: Transform LLMs into State-Transition Reasoners0
Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications0
AgentInstruct: Toward Generative Teaching with Agentic Flows0
DavIR: Data Selection via Implicit Reward for Large Language Models0
Local Prompt Optimization0
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems0
Transcending Scaling Laws with 0.1% Extra Compute0
Look Before You Leap: Problem Elaboration Prompting Improves Mathematical Reasoning in Large Language Models0
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
LoRA-Mixer: Coordinate Modular LoRA Experts Through Serial Attention Routing0
Show:102550
← PrevPage 18 of 18Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified