SOTAVerified

GSM8K

Papers

Showing 251260 of 439 papers

TitleStatusHype
Iterative Reasoning Preference Optimization0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
KisMATH: Do LLMs Have Knowledge of Implicit Structures in Mathematical Reasoning?0
Kwai-STaR: Transform LLMs into State-Transition Reasoners0
KwaiYiiMath: Technical Report0
Large Language Models as Analogical Reasoners0
Large Language Models Can Self-Improve0
Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge0
LearnAlign: Reasoning Data Selection for Reinforcement Learning in Large Language Models Based on Improved Gradient Alignment0
Learning to Rank Chain-of-Thought: An Energy-Based Approach with Outcome Supervision0
Show:102550
← PrevPage 26 of 44Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1XolverAccuracy98.1Unverified
2Orange-mini0-shot MRR98Unverified