SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1175 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Stem-ming the Tide: Predicting STEM attrition using student transcript data	Aug 28, 2017	BIG-bench Machine LearningMath	—Unverified	0	0
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing	Nov 1, 2024	2kIn-Context Learning	—Unverified	0	0
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo	Oct 2, 2024	Math	—Unverified	0	0
xGen-small Technical Report	May 10, 2025	DecoderMath	—Unverified	0	0
VideoGameBench: Can Vision-Language Models complete popular video games?	May 23, 2025	Math	—Unverified	0	0
Step Guided Reasoning: Improving Mathematical Reasoning using Guidance Generation and Step Reasoning	Oct 18, 2024	MathMathematical Reasoning	—Unverified	0	0
Step-KTO: Optimizing Mathematical Reasoning through Stepwise Binary Feedback	Jan 18, 2025	MathMathematical Reasoning	—Unverified	0	0
A case study : Influence of Dimension Reduction on regression trees-based Algorithms -Predicting Aeronautics Loads of a Derivative Aircraft	Nov 16, 2018	Dimensionality ReductionMath	—Unverified	0	0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation	May 22, 2023	Knowledge TracingMath	—Unverified	0	0
Agent-RLVR: Training Software Engineering Agents via Guidance and Environment Rewards	Jun 13, 2025	MathNavigate	—Unverified	0	0
A Careful Examination of Large Language Model Performance on Grade School Arithmetic	May 1, 2024	GSM8KLanguage Modeling	—Unverified	0	0
Strictly monotone mean-variance preferences with applications to portfolio selection	Dec 18, 2024	ManagementMath	—Unverified	0	0
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs	Dec 23, 2024	BenchmarkingLogical Reasoning	—Unverified	0	0
A Bayesian model for recognizing handwritten mathematical expressions	Sep 18, 2014	Mathmodel	—Unverified	0	0
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class	Aug 26, 2024	Math	—Unverified	0	0
VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM	Nov 8, 2024	Math	—Unverified	0	0
Subtle Errors Matter: Preference Learning via Error-injected Self-editing	Oct 9, 2024	GSM8KMath	—Unverified	0	0
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications	Jan 9, 2025	MathRAG	—Unverified	0	0
Supervised Optimism Correction: Be Confident When LLMs Are Sure	Apr 10, 2025	GSM8KMath	—Unverified	0	0
Sustainable Border Control Policy in the COVID-19 Pandemic: A Math Modeling Study	Aug 28, 2020	Math	—Unverified	0	0
SVM-based Deep Stacking Networks	Feb 15, 2019	Math	—Unverified	0	0
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution	Feb 25, 2025	MathReinforcement Learning (RL)	—Unverified	0	0
Visual Analytics of Student Learning Behaviors on K-12 Mathematics E-learning Platforms	Sep 7, 2019	Math	—Unverified	0	0
Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning	Mar 7, 2025	GPUMath	—Unverified	0	0
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning	Jun 29, 2024	Binary ClassificationGSM8K	—Unverified	0	0

Show:10 25 50

← PrevPage 47 of 64Next →

No leaderboard results yet.