SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1426–1450 of 1596 papers

Title	Date	Tasks	Status	Hype
Bounds on Multi-asset Derivatives via Neural Networks	Nov 13, 2019	Math	CodeCode Available	0
HAPO: Training Language Models to Reason Concisely via History-Aware Policy Optimization	May 16, 2025	Math	CodeCode Available	0
Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models	Jul 9, 2024	Math	CodeCode Available	0
CER: Confidence Enhanced Reasoning in LLMs	Feb 20, 2025	MathMathematical Reasoning	CodeCode Available	0
A Robustly Optimized Long Text to Math Models for Numerical Reasoning On FinQA	Jun 29, 2022	Math	CodeCode Available	0
TutorGym: A Testbed for Evaluating AI Agents as Tutors and Students	May 2, 2025	GSM8KIn-Context Learning	CodeCode Available	0
Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory Network	May 1, 2022	Math	CodeCode Available	0
Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts?	Mar 23, 2025	GSM8KMath	CodeCode Available	0
Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning	Sep 17, 2024	Few-Shot LearningIn-Context Learning	CodeCode Available	0
Reasoning in Large Language Models Through Symbolic Math Word Problems	Aug 3, 2023	Math	CodeCode Available	0
The Relationship Between Reasoning and Performance in Large Language Models -- o3 (mini) Thinks Harder, Not Longer	Feb 21, 2025	MathMathematical Reasoning	CodeCode Available	0
The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices	Oct 4, 2023	ArticlesMath	CodeCode Available	0
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation	Dec 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	0
SemEval-2019 Task 10: Math Question Answering	Jun 1, 2019	MathQuestion Answering	CodeCode Available	0
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?	Jun 3, 2023	MathMath Word Problem Solving	CodeCode Available	0
Sequence to General Tree: Knowledge-Guided Geometry Word Problem Solving	Jun 2, 2021	Math	CodeCode Available	0
Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models	Oct 10, 2024	Arithmetic ReasoningMath	CodeCode Available	0
GThinker: Towards General Multimodal Reasoning via Cue-Guided Rethinking	Jun 1, 2025	4kMath	CodeCode Available	0
Guided Speculative Inference for Efficient Test-Time Alignment of LLMs	Jun 4, 2025	Math	CodeCode Available	0
Can Vision-Language Models Evaluate Handwritten Math?	Jan 13, 2025	Math	CodeCode Available	0
Adversarial Examples for Evaluating Math Word Problem Solvers	Sep 13, 2021	Adversarial RobustnessMath	CodeCode Available	0
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?	Oct 27, 2024	Data AugmentationMath	CodeCode Available	0
Effective Skill Unlearning through Intervention and Abstention	Mar 27, 2025	General KnowledgeMath	CodeCode Available	0
Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning	Oct 16, 2024	AllGSM8K	CodeCode Available	0
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate Class	May 17, 2025	MathMathematical Problem-Solving	CodeCode Available	0

Show:10 25 50

← PrevPage 58 of 64Next →

No leaderboard results yet.