SOTAVerified

Math

Papers

Showing 10011050 of 1596 papers

TitleStatusHype
Cramer-Rao bound and absolute sensitivity in chemical reaction networks0
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities0
The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language ModelsCode1
RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust AdaptationCode3
Language Models Encode the Value of Numbers LinearlyCode1
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors0
Graph2Tac: Online Representation Learning of Formal Math Concepts0
Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction0
LLaMA Pro: Progressive LLaMA with Block ExpansionCode4
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model EvaluationCode1
MathPile: A Billion-Token-Scale Pretraining Corpus for MathCode2
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities0
From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting0
An In-depth Look at Gemini's Language AbilitiesCode1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgentCode1
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human AnnotationsCode1
TinyGSM: achieving >80% on GSM8k with small language models0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models0
Get an A in Math: Progressive Rectification PromptingCode1
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning0
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and LayersCode1
ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math QuestionsCode0
Eliciting Latent Knowledge from Quirky Language ModelsCode1
YUAN 2.0: A Large Language Model with Localized Filtering-based AttentionCode2
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints0
MathGloss: Building mathematical glossaries from textCode1
Meta Prompting for AI SystemsCode2
System 2 Attention (is something you might need too)Code2
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized DocumentsCode1
FinanceMath: Knowledge-Intensive Math Reasoning in Finance DomainsCode1
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem SolvingCode1
Towards Reasoning in Large Language Models via Multi-Agent Peer Review CollaborationCode1
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning0
SAIE Framework: Support Alone Isn't Enough -- Advancing LLM Training with Adversarial Remarks0
VerityMath: Advancing Mathematical Reasoning by Self-Verification Through Unit ConsistencyCode0
Large Language Models' Understanding of Math: Source Criticism and Extrapolation0
Let's Reinforce Step by Step0
Conic10K: A Challenging Math Problem Understanding and Reasoning DatasetCode1
Agent Lumos: Unified and Modular Training for Open-Source Language AgentsCode2
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMsCode1
Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language ModelsCode0
Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation0
ATHENA: Mathematical Reasoning with Thought ExpansionCode0
Implicit Chain of Thought Reasoning via Knowledge DistillationCode1
Unleashing the Creative Mind: Language Model As Hierarchical Policy For Improved Exploration on Challenging Problem SolvingCode0
Learning From Mistakes Makes LLM Better ReasonerCode1
Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and ObservationsCode1
Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP TasksCode0
math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories0
Show:102550
← PrevPage 21 of 32Next →

No leaderboard results yet.