SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 476–500 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
From Zero to Hero: Convincing with Extremely Complicated Math	Apr 1, 2023	Math	CodeCode Available	1	5
Building Dataset for Grounding of Formulae — Annotating Coreference Relations Among Math Identifiers	Jun 1, 2022	Math	CodeCode Available	1	5
A Relation Spectrum Inheriting Taylor Series: Muscle Synergy and Coupling for Hand	Apr 25, 2020	MathRelation	CodeCode Available	1	5
FINEREASON: Evaluating and Improving LLMs' Deliberate Reasoning through Reflective Puzzle Solving	Feb 27, 2025	GSM8KMath	CodeCode Available	1	5
NeMo-Inspector: A Visualization Tool for LLM Generation Analysis	May 1, 2025	GSM8KMath	CodeCode Available	1	5
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models	May 23, 2023	Math	CodeCode Available	1	5
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks	Jul 3, 2021	DecoderMath	CodeCode Available	1	5
NLPBench: Evaluating Large Language Models on Solving NLP Problems	Sep 27, 2023	BenchmarkingMath	CodeCode Available	1	5
Entropy-Regularized Process Reward Model	Dec 15, 2024	GSM8KMath	CodeCode Available	1	5
ArMATH: a Dataset for Solving Arabic Math Word Problems	Jun 1, 2022	Deep LearningMath	CodeCode Available	1	5
Arithmetic Without Algorithms: Language Models Solve Math With a Bag of Heuristics	Oct 28, 2024	Arithmetic ReasoningMath	CodeCode Available	1	5
FELM: Benchmarking Factuality Evaluation of Large Language Models	Oct 1, 2023	BenchmarkingMath	CodeCode Available	1	5
Fine-Tuning Large Language Models on Quantum Optimization Problems for Circuit Generation	Apr 15, 2025	MathQuantum Machine Learning	CodeCode Available	1	5
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models	Mar 4, 2025	GSM8KMath	CodeCode Available	1	5
Pretrained Language Models are Symbolic Mathematics Solvers too!	Oct 7, 2021	IngenuityLanguage Modelling	CodeCode Available	1	5
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation	Dec 28, 2023	GSM8KLanguage Model Evaluation	CodeCode Available	1	5
Problem-Oriented Segmentation and Retrieval: Case Study on Tutoring Conversations	Nov 12, 2024	MathRetrieval	CodeCode Available	1	5
Expression Syntax Information Bottleneck for Math Word Problems	Oct 24, 2023	Math	CodeCode Available	1	5
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning	Jun 4, 2023	Math	CodeCode Available	1	5
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts	Oct 23, 2023	Logical ReasoningMath	CodeCode Available	1	5
EXAONE Deep: Reasoning Enhanced Language Models	Mar 16, 2025	Math	CodeCode Available	1	5
Achieving >97% on GSM8K: Deeply Understanding the Problems Makes LLMs Better Solvers for Math Word Problems	Apr 23, 2024	Arithmetic ReasoningGSM8K	CodeCode Available	1	5
Explaining Datasets in Words: Statistical Models with Natural Language Parameters	Sep 13, 2024	ClusteringLanguage Modeling	CodeCode Available	1	5
Are NLP Models really able to Solve Simple Math Word Problems?	Mar 12, 2021	MathMath Word Problem Solving	CodeCode Available	1	5
Case-Based or Rule-Based: How Do Transformers Do the Math?	Feb 27, 2024	MathSystematic Generalization	CodeCode Available	1	5

Show:10 25 50

← PrevPage 20 of 64Next →

No leaderboard results yet.