SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1126–1150 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Smiles in delta	Sep 1, 2022	Math	—Unverified	0	0
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model	Feb 4, 2025	Instruction FollowingLanguage Modeling	—Unverified	0	0
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training	Nov 21, 2024	Math	—Unverified	0	0
SOLAR: Scalable Optimization of Large-scale Architecture for Reasoning	Mar 6, 2025	GSM8KMath	—Unverified	0	0
Solving Arithmetic Word Problems Using Transformer and Pre-processing of Problem Texts	Dec 1, 2020	Math	—Unverified	0	0
Solving Arithmetic Word Problems with Transformers and Preprocessing of Problem Text	Jun 2, 2021	Math	—Unverified	0	0
Solving Linear Algebra by Program Synthesis	Nov 16, 2021	MathProgram Synthesis	—Unverified	0	0
Solving Linear Algebra by Program Synthesis	Nov 16, 2021	MathProgram Synthesis	—Unverified	0	0
Heterogeneous Line Graph Transformer for Math Word Problems	Aug 11, 2022	MathRepresentation Learning	—Unverified	0	0
Veracity Bias and Beyond: Uncovering LLMs' Hidden Beliefs in Problem-Solving Reasoning	May 22, 2025	AttributeMath	—Unverified	0	0
Solving Math Word Problems with Double-Decoder Transformer	Aug 28, 2019	DecoderMath	—Unverified	0	0
Solving math word problems with process- and outcome-based feedback	Nov 25, 2022	Arithmetic ReasoningGSM8K	—Unverified	0	0
VGR: Visual Grounded Reasoning	Jun 13, 2025	Large Language ModelMath	—Unverified	0	0
SPARQ: Synthetic Problem Generation for Reasoning via Quality-Diversity Algorithms	Jun 6, 2025	DiversityLarge Language Model	—Unverified	0	0
Speculative Knowledge Distillation: Bridging the Teacher-Student Gap Through Interleaved Sampling	Oct 15, 2024	Instruction FollowingKnowledge Distillation	—Unverified	0	0
SPIN-Bench: How Well Do LLMs Plan Strategically and Reason Socially?	Mar 16, 2025	Board GamesCard Games	—Unverified	0	0
SplitReason: Learning To Offload Reasoning	Apr 23, 2025	Language ModelingLanguage Modelling	—Unverified	0	0
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model	Jul 9, 2025	Language ModelingLanguage Modelling	—Unverified	0	0
SSR: Speculative Parallel Scaling Reasoning in Test-time	May 21, 2025	DiversityMath	—Unverified	0	0
Stable Code Technical Report	Apr 1, 2024	Code CompletionLanguage Modelling	—Unverified	0	0
AI4Math: A Native Spanish Benchmark for University-Level Mathematical Reasoning in Large Language Models	May 25, 2025	MathMathematical Reasoning	—Unverified	0	0
START: Self-taught Reasoner with Tools	Mar 6, 2025	MathSelf-Learning	—Unverified	0	0
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions	Dec 12, 2024	GSM8KKnowledge Graphs	—Unverified	0	0
Steering LLM Reasoning Through Bias-Only Adaptation	May 24, 2025	GSM8KMath	—Unverified	0	0
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking	May 30, 2025	Language ModelingLanguage Modelling	—Unverified	0	0

Show:10 25 50

← PrevPage 46 of 64Next →

No leaderboard results yet.