SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1576–1596 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Limits of an AI program for solving college math problems	Aug 14, 2022	Few-Shot LearningMath	—Unverified	0	0
Automatized Evaluation of Formalization Exercises in Mathematics	Jun 2, 2020	MathSentence	—Unverified	0	0
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks	Dec 17, 2024	Math	—Unverified	0	0
Automatic tagging of knowledge points for K12 math problems	Aug 21, 2022	ClassificationMath	—Unverified	0	0
Automatic Robustness Stress Testing of LLMs as Mathematical Problem Solvers	Jun 5, 2025	GSM8KMath	—Unverified	0	0
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ	Sep 25, 2024	ChatbotGSM8K	—Unverified	0	0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified	0	0
Meaning-Typed Programming: Language Abstraction and Runtime for Model-Integrated Applications	May 14, 2024	GSM8KMath	—Unverified	0	0
LLMs as Potential Brainstorming Partners for Math and Science Problems	Oct 10, 2023	Math	—Unverified	0	0
Automatic Generation of High Quality CCGbanks for Parser Domain Adaptation	Jun 5, 2019	Domain AdaptationMath	—Unverified	0	0
LLMs can Find Mathematical Reasoning Mistakes by Pedagogical Chain-of-Thought	May 9, 2024	HallucinationMath	—Unverified	0	0
LLMs Do Not Have Human-Like Working Memory	Apr 30, 2025	Math	—Unverified	0	0
LLM The Genius Paradox: A Linguistic and Math Expert's Struggle with Simple Word-based Counting Problems	Oct 18, 2024	In-Context LearningMath	—Unverified	0	0
Local and global asymptotic inference in smoothing spline models	Dec 30, 2012	Mathvalid	—Unverified	0	0
Local Prompt Optimization	Apr 29, 2025	GSM8KMath	—Unverified	0	0
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems	Aug 29, 2024	GSM8KLanguage Modeling	—Unverified	0	0
What's Behind PPO's Collapse in Long-CoT? Value Optimization Holds the Secret	Mar 3, 2025	MathReinforcement Learning (RL)	—Unverified	0	0
Long Is More Important Than Difficult for Training Reasoning Models	Mar 23, 2025	Math	—Unverified	0	0
LongPerceptualThoughts: Distilling System-2 Reasoning for System-1 Perception	Apr 21, 2025	MathMMLU	—Unverified	0	0
Long-range Sequence Modeling with Predictable Sparse Attention	May 1, 2022	Math	—Unverified	0	0
LookAlike: Consistent Distractor Generation in Math MCQs	May 3, 2025	Distractor GenerationMath	—Unverified	0	0

Show:10 25 50

← PrevPage 64 of 64Next →

No leaderboard results yet.