SOTAVerified

Math

Papers

Showing 101125 of 1596 papers

TitleStatusHype
MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical ReasoningCode3
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM FinetuningCode3
Step-level Value Preference Optimization for Mathematical ReasoningCode3
ThoughtSource: A central hub for large language model reasoning dataCode3
Llemma: An Open Language Model For MathematicsCode3
Dynamic Early Exit in Reasoning ModelsCode2
Memorizing TransformersCode2
Measuring Multimodal Mathematical Reasoning with MATH-Vision DatasetCode2
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual ContextsCode2
MegaMath: Pushing the Limits of Open Math CorporaCode2
Meta-Design Matters: A Self-Design Multi-Agent SystemCode2
MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics BenchmarkCode2
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical ReasoningCode2
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language ModelsCode2
MAS-Zero: Designing Multi-Agent Systems with Zero SupervisionCode2
Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language ModelsCode2
MetaMath: Bootstrap Your Own Mathematical Questions for Large Language ModelsCode2
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical CodeCode2
AGIEval: A Human-Centric Benchmark for Evaluating Foundation ModelsCode2
MathOdyssey: Benchmarking Mathematical Problem-Solving Skills in Large Language Models Using Odyssey Math DataCode2
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-SolvingCode2
Measuring Mathematical Problem Solving With the MATH DatasetCode2
Agent RL Scaling Law: Agent RL with Spontaneous Code Execution for Mathematical Problem SolvingCode2
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of ParametersCode2
Cumulative Reasoning with Large Language ModelsCode2
Show:102550
← PrevPage 5 of 64Next →

No leaderboard results yet.