SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–525 of 1596 papers

Title	Date	Tasks	Status	Hype
Ape210K: A Large-Scale and Template-Rich Dataset of Math Word Problems	Sep 24, 2020	DiversityMath	CodeCode Available	1
Graph-to-Tree Learning for Solving Math Word Problems	Jul 1, 2020	DecoderMath	CodeCode Available	1
A Relation Spectrum Inheriting Taylor Series: Muscle Synergy and Coupling for Hand	Apr 25, 2020	MathRelation	CodeCode Available	1
SIPA: A Simple Framework for Efficient Networks	Apr 24, 2020	Math	CodeCode Available	1
StereoSet: Measuring stereotypical bias in pretrained language models	Apr 20, 2020	Bias DetectionMath	CodeCode Available	1
Injecting Numerical Reasoning Skills into Language Models	Apr 9, 2020	Data AugmentationDecoder	CodeCode Available	1
Graph-to-Tree Neural Networks for Learning Structured Input-Output Translation with Applications to Semantic Parsing and Math Word Problem	Apr 7, 2020	DecoderMachine Translation	CodeCode Available	1
ScanSSD: Scanning Single Shot Detector for Mathematical Formulas in PDF Document Images	Mar 18, 2020	Math	CodeCode Available	1
Discovering Mathematical Objects of Interest -- A Study of Mathematical Notations	Feb 7, 2020	Information RetrievalMath	CodeCode Available	1
A Tree-Structured Decoder for Image-to-Markup Generation	Jan 1, 2020	DecoderHandwritten Mathmatical Expression Recognition	CodeCode Available	1
Template-based math word problem solvers with recursive neural networks	Jul 17, 2019	Math	CodeCode Available	1
From GAN to WGAN	Apr 18, 2019	Generative Adversarial NetworkMath	CodeCode Available	1
VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks	Jul 17, 2025	MathMathematical Reasoning	—Unverified	0
QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation	Jul 17, 2025	MathReinforcement Learning (RL)	—Unverified	0
Scaling Up RL: Unlocking Diverse Reasoning in LLMs via Prolonged Training	Jul 16, 2025	Code GenerationMath	—Unverified	0
Temperature and Persona Shape LLM Agent Consensus With Minimal Accuracy Gains in Qualitative Coding	Jul 15, 2025	Math	—Unverified	0
Personalized Exercise Recommendation with Semantically-Grounded Knowledge Tracing	Jul 15, 2025	Knowledge TracingMath	CodeCode Available	0
Skip a Layer or Loop it? Test-Time Depth Adaptation of Pretrained LLMs	Jul 10, 2025	CoLALarge Language Model	—Unverified	0
Squeeze the Soaked Sponge: Efficient Off-policy Reinforcement Finetuning for Large Language Model	Jul 9, 2025	Language ModelingLanguage Modelling	—Unverified	0
CoRE: Enhancing Metacognition with Label-free Self-evaluation in LRMs	Jul 8, 2025	GSM8KMath	—Unverified	0
Activation Steering for Chain-of-Thought Compression	Jul 7, 2025	GSM8KMath	CodeCode Available	0
Effects of structure on reasoning in instance-level Self-Discover	Jul 4, 2025	Math	CodeCode Available	0
Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model	Jun 30, 2025	Math	—Unverified	0
Bridging Offline and Online Reinforcement Learning for LLMs	Jun 26, 2025	Instruction FollowingMath	—Unverified	0
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test	Jun 26, 2025	Code GenerationLarge Language Model	—Unverified	0

Show:10 25 50

← PrevPage 21 of 64Next →

No leaderboard results yet.