SOTAVerified

Math

Papers

Showing 10011025 of 1596 papers

TitleStatusHype
Cramer-Rao bound and absolute sensitivity in chemical reaction networks0
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities0
The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language ModelsCode1
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationCode3
Language Models Encode the Value of Numbers LinearlyCode1
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors0
Graph2Tac: Online Representation Learning of Formal Math Concepts0
Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction0
LLaMA Pro: Progressive LLaMA with Block ExpansionCode4
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model EvaluationCode1
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities0
From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting0
An In-depth Look at Gemini's Language AbilitiesCode1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgentCode1
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
TinyGSM: achieving >80% on GSM8k with small language models0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models0
Get an A in Math: Progressive Rectification PromptingCode1
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning0
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and LayersCode1
ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math QuestionsCode0
Eliciting Latent Knowledge from Quirky Language ModelsCode1
YUAN 2.0: A Large Language Model with Localized Filtering-based AttentionCode2
Show:102550
← PrevPage 41 of 64Next →

No leaderboard results yet.