SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 601–625 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models	Jun 5, 2024	MathMathematical Reasoning	CodeCode Available	0	5
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?	May 28, 2025	MathMathematical Problem-Solving	CodeCode Available	0	5
Analysis of Optimization Algorithms via Sum-of-Squares	Jun 11, 2019	Math	CodeCode Available	0	5
Automatic Generation of Headlines for Online Math Questions	Nov 27, 2019	Math	CodeCode Available	0	5
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition	Jan 5, 2018	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	0	5
More is More: Addition Bias in Large Language Models	Sep 4, 2024	MathText Summarization	CodeCode Available	0	5
Analogical Math Word Problems Solving with Enhanced Problem-Solution Association	Dec 1, 2022	MathQuestion Answering	CodeCode Available	0	5
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions	Jul 1, 2019	Deep LearningMath	CodeCode Available	0	5
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization	May 16, 2025	Math	CodeCode Available	0	5
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-Making	May 22, 2020	BIG-bench Machine LearningDecision Making	CodeCode Available	0	5
A mixed policy to improve performance of language models on math problems	Jul 17, 2023	GSM8KMath	CodeCode Available	0	5
Critical-Questions-of-Thought: Steering LLM reasoning with Argumentative Querying	Dec 19, 2024	MathMathematical Reasoning	CodeCode Available	0	5
Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing	Oct 2, 2024	Contrastive LearningKnowledge Tracing	CodeCode Available	0	5
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models	May 30, 2025	MathMultiple-choice	CodeCode Available	0	5
Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia	Oct 2, 2024	Language ModelingLanguage Modelling	CodeCode Available	0	5
MIRB: Mathematical Information Retrieval Benchmark	May 21, 2025	Automated Theorem ProvingInformation Retrieval	CodeCode Available	0	5
MMATH: A Multilingual Benchmark for Mathematical Reasoning	May 25, 2025	MathMathematical Reasoning	CodeCode Available	0	5
Meta-Reasoning Improves Tool Use in Large Language Models	Nov 7, 2024	Math	CodeCode Available	0	5
A Meaning-based Statistical English Math Word Problem Solver	Mar 16, 2018	Math	CodeCode Available	0	5
metboost: Exploratory regression analysis with hierarchically clustered data	Feb 13, 2017	MathMissing Values	CodeCode Available	0	5
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available	0	5
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks	Jul 30, 2023	MathOptical Character Recognition	CodeCode Available	0	5
Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network	May 1, 2022	Math	CodeCode Available	0	5
MAWPS: A Math Word Problem Repository	Jun 1, 2016	MathMath Word Problem Solving	CodeCode Available	0	5
Math Word Problem Solving by Generating Linguistic Variants of Problem Statements	Jun 24, 2023	DecoderIngenuity	CodeCode Available	0	5

Show:10 25 50

← PrevPage 25 of 64Next →

No leaderboard results yet.