Math Word Problem Solving

A math word problem is a mathematical exercise (such as in a textbook, worksheet, or exam) where significant background information on the problem is presented in ordinary language rather than in mathematical notation. As most word problems involve a narrative of some sort, they are sometimes referred to as story problems and may vary in the amount of technical language used.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–100 of 107 papers

Title	Date	Tasks	Status	Hype
Learning Multi-Step Reasoning by Solving Arithmetic Tasks	Jun 2, 2023	MathMathematical Reasoning	CodeCode Available	1
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems	Nov 23, 2022	MathMath Word Problem Solving	CodeCode Available	1
Do Multilingual Language Models Think Better in English?	Aug 2, 2023	Common Sense ReasoningCross-Lingual Natural Language Inference	CodeCode Available	1
ELASTIC: Numerical Reasoning with Adaptive Symbolic Compiler	Oct 18, 2022	Math Word Problem SolvingQuestion Answering	CodeCode Available	1
Semantically-Aligned Universal Tree-Structured Solver for Math Word Problems	Oct 14, 2020	DecoderMath	CodeCode Available	1
RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning	May 23, 2023	In-Context LearningLanguage Modelling	CodeCode Available	1
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems	Apr 23, 2024	Arithmetic ReasoningGSM8K	CodeCode Available	1
Learn to Solve Algebra Word Problems Using Quadratic Programming	Sep 1, 2015	Math Word Problem Solving	—Unverified	0
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach	Sep 1, 2020	MathMath Word Problem Solving	—Unverified	0
A Knowledge-Aware Sequence-to-Tree Network for Math Word Problem Solving	Nov 1, 2020	Common Sense ReasoningDecoder	—Unverified	0
An Improved Coarse-to-Fine Method for Solving Generation Tasks	Apr 1, 2019	MathMath Word Problem Solving	—Unverified	0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified	0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?	Jun 29, 2023	Language ModelingLanguage Modelling	—Unverified	0
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving	Apr 5, 2024	Data AugmentationIn-Context Learning	—Unverified	0
Deep Neural Solver for Math Word Problems	Sep 1, 2017	Feature EngineeringMachine Translation	—Unverified	0
Generate & Rank: A Multi-task Framework for Math Word Problems	Sep 7, 2021	Language ModelingLanguage Modelling	—Unverified	0
Generating Equation by Utilizing Operators : GEO model	Dec 1, 2020	DecoderMachine Translation	—Unverified	0
How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation	Aug 1, 2016	Community Question AnsweringMath	—Unverified	0
Illinois Math Solver: Math Reasoning on the Web	Jun 1, 2016	MathMath Word Problem Solving	—Unverified	0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning	Mar 4, 2024	GSM8KMath	—Unverified	0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Nov 25, 2024	MathMath Word Problem Solving	—Unverified	0
Learning Fine-Grained Expressions to Solve Math Word Problems	Sep 1, 2017	MathMath Word Problem Solving	—Unverified	0
Learning from Explicit and Implicit Supervision Jointly For Algebra Word Problems	Nov 1, 2016	Math Word Problem Solving	—Unverified	0
Learning to Automatically Solve Algebra Word Problems	Jun 1, 2014	Math Word Problem Solving	—Unverified	0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation	May 22, 2023	Knowledge TracingMath	—Unverified	0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms	May 30, 2019	MathMath Word Problem Solving	—Unverified	0
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving	Jan 16, 2022	Language ModelingLanguage Modelling	—Unverified	0
Neural Math Word Problem Solver with Reinforcement Learning	Aug 1, 2018	Feature EngineeringMath	—Unverified	0
OpenChat: Advancing Open-source Language Models with Mixed-Quality Data	Sep 20, 2023	Arithmetic ReasoningCode Generation	—Unverified	0
Program Synthesis Benchmark for Visual Programming in XLogoOnline Environment	Jun 17, 2024	Logical ReasoningMath	—Unverified	0
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement	Sep 18, 2024	GSM8KMath	—Unverified	0
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models	Aug 1, 2023	In-Context LearningMath	—Unverified	0
Towards Interpretable Math Word Problem Solving with Grounded Linguistic Logic Reasoning	Nov 16, 2021	MathMath Word Problem Solving	—Unverified	0
Translating a Math Word Problem to a Expression Tree	Oct 1, 2018	Machine TranslationMath	—Unverified	0
Using Intermediate Representations to Solve Math Word Problems	Jul 1, 2018	MathMath Word Problem Solving	—Unverified	0
When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems	Oct 16, 2024	HallucinationMath	—Unverified	0
Improving Compositional Generalization in Math Word Problem Solving	Sep 3, 2022	Data AugmentationMath	CodeCode Available	0
Frugal LMs Trained to Invoke Symbolic Solvers Achieve Parameter-Efficient Arithmetic Reasoning	Dec 9, 2023	Arithmetic ReasoningMathematical Reasoning	CodeCode Available	0
Reverse Operation based Data Augmentation for Solving Math Word Problems	Oct 4, 2020	Data AugmentationMath	CodeCode Available	0
SBI-RAG: Enhancing Math Word Problem Solving for Students through Schema-Based Instruction and Retrieval-Augmented Generation	Oct 17, 2024	GSM8KLanguage Modeling	CodeCode Available	0
Semantically-Aligned Equation Generation for Solving and Reasoning Math Word Problems	Nov 2, 2018	DecoderMath	CodeCode Available	0
EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbers	May 1, 2022	MathMath Word Problem Solving	CodeCode Available	0
Translating a Math Word Problem to an Expression Tree	Nov 14, 2018	MathMath Word Problem Solving	CodeCode Available	0
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?	Jun 3, 2023	MathMath Word Problem Solving	CodeCode Available	0
ATHENA: Mathematical Reasoning with Thought Expansion	Nov 2, 2023	MathMathematical Reasoning	CodeCode Available	0
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving	Nov 1, 2021	DecoderMath	CodeCode Available	0
Analysing Mathematical Reasoning Abilities of Neural Models	Apr 2, 2019	Mathematical Question AnsweringMathematical Reasoning	CodeCode Available	0
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models	Oct 10, 2024	Arithmetic ReasoningMath	CodeCode Available	0
A Goal-Driven Tree-Structured Neural Model for Math Word Problems	Aug 10, 2019	MathMath Word Problem Solving	CodeCode Available	0
Adversarial Examples for Evaluating Math Word Problem Solvers	Sep 13, 2021	Adversarial RobustnessMath	CodeCode Available	0

Show:10 25 50

← PrevPage 2 of 3Next →

All datasets MATH SVAMP MAWPS Math23K ALG514 ASDiv-A ParaMAWPS DRAW-1K MathQA SVAMP (1:N)GSM-Plus MATH minival

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Gemini 2.0 Flash Experimental	Accuracy	89.7	—	Unverified
2	Qwen2.5-Math-72B-Instruct(TIR,Greedy)	Accuracy	88.1	—	Unverified
3	GPT-4 Turbo (MACM, w/code, voting)	Accuracy	87.92	—	Unverified
4	Qwen2.5-Math-72B-Instruct(COT,Greedy)	Accuracy	85.9	—	Unverified
5	Qwen2.5-Math-7B-Instruct(TIR,Greedy)	Accuracy	85.2	—	Unverified
6	GPT-4-code model (CSV, w/ code, SC, k=16)	Accuracy	84.3	—	Unverified
7	Qwen2-Math-72B-Instruct(greedy)	Accuracy	84	—	Unverified
8	Qwen2.5-Math-7B-Instruct(COT,Greedy)	Accuracy	83.6	—	Unverified
9	Qwen2.5-Math-1.5B-Instruct(TIR,Greedy)	Accuracy	79.9	—	Unverified
10	OpenMath2-Llama3.1-70B (majority@256)	Accuracy	79.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 DUP	Accuracy	94.2	—	Unverified
2	GPT-4 (Teaching-Inspired)	Execution Accuracy	93.9	—	Unverified
3	GPT-4 (Model Selection)	Execution Accuracy	93.7	—	Unverified
4	Qwen2(CoT + Code Interpreter)	Execution Accuracy	92.3	—	Unverified
5	GPT-4 (PHP)	Execution Accuracy	91.9	—	Unverified
6	OpenMath-CodeLlama-70B (w/ code)	Execution Accuracy	87.8	—	Unverified
7	MathCoder-L-70B	Execution Accuracy	84.9	—	Unverified
8	PoT_Eng (self-consistency @ 5)	Execution Accuracy	83.7	—	Unverified
9	CoT_Eng (self-consistency @ 5)	Execution Accuracy	82.5	—	Unverified
10	MMOS-CODE-34B(0-shot)	Execution Accuracy	80.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OpenMath-CodeLlama-70B (w/ code)	Accuracy (%)	95.7	—	Unverified
2	MsAT-DeductReasoner	Accuracy (%)	94.3	—	Unverified
3	ATHENA (roberta-large)	Accuracy (%)	93	—	Unverified
4	Exp-Tree	Accuracy (%)	92.3	—	Unverified
5	Multi-view	Accuracy (%)	92.3	—	Unverified
6	ATHENA (roberta-base)	Accuracy (%)	92.2	—	Unverified
7	Roberta-DeductReasoner	Accuracy (%)	92	—	Unverified
8	DeBERTa (PM + VM)	Accuracy (%)	91	—	Unverified
9	EPT	Accuracy (%)	88.7	—	Unverified
10	Graph2Tree with RoBERTa	Accuracy (%)	88.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 (Teaching-Inspired)	Accuracy (5-fold)	94.3	—	Unverified
2	ATHENA (roberta-large)	Accuracy (training-test)	86.5	—	Unverified
3	Multi-view* (ours)	Accuracy (5-fold)	85.2	—	Unverified
4	ATHENA (roberta-base)	Accuracy (training-test)	84.4	—	Unverified
5	Generate and Rank	Accuracy (5-fold)	84.3	—	Unverified
6	Exp-Tree	Accuracy (5-fold)	84.1	—	Unverified
7	REAL2: Memory-augmented Solver	Accuracy (5-fold)	83.18	—	Unverified
8	Roberta-DeductReasoner	Accuracy (5-fold)	83	—	Unverified
9	MWP-BERT	Accuracy (5-fold)	82.4	—	Unverified
10	Recall and Learn	Accuracy (5-fold)	80.8	—	Unverified