SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 576–600 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
An algorithm to represent inbreeding trees	Sep 21, 2020	Math	CodeCode Available	0	5
DIVE: Diversified Iterative Self-Improvement	Jan 1, 2025	DiversityGSM8K	CodeCode Available	0	5
Benchmarking Large Language Models for Math Reasoning Tasks	Aug 20, 2024	BenchmarkingIn-Context Learning	CodeCode Available	0	5
OntoMath^PRO Ontology: A Linked Data Hub for Mathematics	Jul 17, 2014	Math	CodeCode Available	0	5
Distinguishing affixoid formations from compounds	Aug 1, 2018	ManagementMath	CodeCode Available	0	5
Discriminative Policy Optimization for Token-Level Reward Models	May 29, 2025	GSM8KLanguage Modeling	CodeCode Available	0	5
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem	Mar 6, 2024	BenchmarkingHallucination	CodeCode Available	0	5
An Edge-Enhanced Hierarchical Graph-to-Tree Network for Math Word Problem Solving	Nov 1, 2021	DecoderMath	CodeCode Available	0	5
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning	Oct 16, 2024	AllGSM8K	CodeCode Available	0	5
NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models	Jun 5, 2024	MathMathematical Reasoning	CodeCode Available	0	5
One Language, Many Gaps: Evaluating Dialect Fairness and Robustness of Large Language Models in Reasoning Tasks	Oct 14, 2024	FairnessGSM8K	CodeCode Available	0	5
Neural Machine Translation and Sequence-to-sequence Models: A Tutorial	Mar 5, 2017	Machine TranslationMath	CodeCode Available	0	5
Deterministic and Nondeterministic Particle Motion with Interaction Mechanisms	Dec 31, 2022	Math	CodeCode Available	0	5
Multi-Scale Attention with Dense Encoder for Handwritten Mathematical Expression Recognition	Jan 5, 2018	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	0	5
More is More: Addition Bias in Large Language Models	Sep 4, 2024	MathText Summarization	CodeCode Available	0	5
AutoMSC: Automatic Assignment of Mathematics Subject Classification Labels	May 25, 2020	ArticlesClassification	CodeCode Available	0	5
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head Attentions	Jul 1, 2019	Deep LearningMath	CodeCode Available	0	5
MM-MATH: Advancing Multimodal Math Evaluation with Process Evaluation and Fine-grained Classification	Apr 7, 2024	Image ComprehensionMath	CodeCode Available	0	5
MMATH: A Multilingual Benchmark for Mathematical Reasoning	May 25, 2025	MathMathematical Reasoning	CodeCode Available	0	5
Automatic Short Math Answer Grading via In-context Meta-learning	May 30, 2022	automatic short answer gradingIn-Context Learning	CodeCode Available	0	5
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available	0	5
A Diversity-Enhanced Knowledge Distillation Model for Practical Math Word Problem Solving	Jan 7, 2025	DiversityKnowledge Distillation	CodeCode Available	0	5
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-Making	May 22, 2020	BIG-bench Machine LearningDecision Making	CodeCode Available	0	5
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language Models	May 30, 2025	MathMultiple-choice	CodeCode Available	0	5
Decomposing Elements of Problem Solving: What "Math" Does RL Teach?	May 28, 2025	MathMathematical Problem-Solving	CodeCode Available	0	5

Show:10 25 50

← PrevPage 24 of 64Next →

No leaderboard results yet.