SOTAVerified

Math

Papers

Showing 11011125 of 1596 papers

TitleStatusHype
Using Large Language Model to Solve and Explain Physics Word Problems Approaching Human Level0
MAmmoTH: Building Math Generalist Models through Hybrid Instruction TuningCode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
MathAttack: Attacking Large Language Models Towards Math Solving Ability0
Solving Math Word Problem with Problem Type ClassificationCode0
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructCode5
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach0
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-VerificationCode2
Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems0
Towards an AI to Win Ghana's National Science and Maths QuizCode1
NEOLAF, an LLM-powered neural-symbolic cognitive architecture0
Cumulative Reasoning with Large Language ModelsCode2
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational DataCode0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
Studying Large Language Model Generalization with Influence FunctionsCode1
A Symbolic Character-Aware Model for Solving Geometry ProblemsCode1
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
Reasoning in Large Language Models Through Symbolic Math Word ProblemsCode0
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models0
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step ReasoningCode1
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math TextbooksCode0
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
ARB: Advanced Reasoning Benchmark for Large Language Models0
Explaining Math Word Problem Solvers0
Controlling Equational Reasoning in Large Language Models with Prompt Interventions0
Show:102550
← PrevPage 45 of 64Next →

No leaderboard results yet.