SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 301–325 of 1596 papers

Title	Date	Tasks	Status	Hype
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation	Mar 25, 2025	Code CompletionLanguage Modeling	CodeCode Available	1
Automatic Generation of Socratic Subquestions for Teaching Math Word Problems	Nov 23, 2022	MathMath Word Problem Solving	CodeCode Available	1
M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models	Apr 14, 2025	MambaMath	CodeCode Available	1
Let's Verify Math Questions Step by Step	May 20, 2025	MathMathematical Reasoning	CodeCode Available	1
Leveraging Online Olympiad-Level Math Problems for LLMs Training and Contamination-Resistant Evaluation	Jan 24, 2025	Math	CodeCode Available	1
A Diverse Corpus for Evaluating and Developing English Math Word Problem Solvers	Jun 30, 2021	DiversityMath	CodeCode Available	1
LEVER: Learning to Verify Language-to-Code Generation with Execution	Feb 16, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	1
Learning Goal-Conditioned Representations for Language Reward Models	Jul 18, 2024	GSM8KMath	CodeCode Available	1
Learning Multi-Step Reasoning by Solving Arithmetic Tasks	Jun 2, 2023	MathMathematical Reasoning	CodeCode Available	1
Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving	May 23, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Learning Math Reasoning from Self-Sampled Correct and Partially-Correct Solutions	May 28, 2022	Arithmetic ReasoningEfficient Exploration	CodeCode Available	1
Learning to Reason Deductively: Math Word Problem Solving as Complex Relation Extraction	Mar 19, 2022	MathMath Word Problem Solving	CodeCode Available	1
DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback	Oct 8, 2024	MathSequential Decision Making	CodeCode Available	1
LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits	Oct 2, 2024	Instruction FollowingMath	CodeCode Available	1
Autoformalize Mathematical Statements by Symbolic Equivalence and Semantic Consistency	Oct 28, 2024	Math	CodeCode Available	1
Large Language Models Can Be Easily Distracted by Irrelevant Context	Jan 31, 2023	Arithmetic ReasoningLanguage Modeling	CodeCode Available	1
AutoBencher: Creating Salient, Novel, Difficult Datasets for Language Models	Jul 11, 2024	Language ModellingMath	CodeCode Available	1
Augmenting Math Word Problems via Iterative Question Composing	Jan 17, 2024	MathMathematical Reasoning	CodeCode Available	1
Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency	Apr 24, 2025	BenchmarkingMath	CodeCode Available	1
Large (Vision) Language Models are Unsupervised In-Context Learners	Apr 3, 2025	GSM8KIn-Context Learning	CodeCode Available	1
Learning by Fixing: Solving Math Word Problems with Weak Supervision	Dec 19, 2020	MathWeakly-supervised Learning	CodeCode Available	1
Language Models as Science Tutors	Feb 16, 2024	GSM8KMath	CodeCode Available	1
Language Models Encode the Value of Numbers Linearly	Jan 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
A Tree-Structured Decoder for Image-to-Markup Generation	Jan 1, 2020	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	1
Non-myopic Generation of Language Models for Reasoning and Planning	Oct 22, 2024	Computational EfficiencyLanguage Modelling	CodeCode Available	1

Show:10 25 50

← PrevPage 13 of 64Next →

No leaderboard results yet.