SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1151–1175 of 1596 papers

Title	Date	Tasks	Status	Hype
RevOrder: A Novel Method for Enhanced Arithmetic in Language Models	Feb 6, 2024	GSM8KMath	—Unverified	0
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision	Feb 5, 2024	GSM8KMath	—Unverified	0
Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation	Feb 4, 2024	HallucinationMath	—Unverified	0
Salsa Fresca: Angular Embeddings and Pre-Training for ML Attacks on Learning With Errors	Feb 2, 2024	Math	—Unverified	0
Large Language Models for Mathematical Reasoning: Progresses and Challenges	Jan 31, 2024	DiversityMath	—Unverified	0
Efficient Tool Use with Chain-of-Abstraction Reasoning	Jan 30, 2024	MathMathematical Reasoning	—Unverified	0
Taxonomy of Mathematical Plagiarism	Jan 30, 2024	MathQuestion Answering	CodeCode Available	0
GAPS: Geometry-Aware Problem Solver	Jan 29, 2024	Geometry Problem SolvingMath	—Unverified	0
YODA: Teacher-Student Progressive Learning for Language Models	Jan 28, 2024	GSM8KMath	—Unverified	0
Exploring Educational Equity: A Machine Learning Approach to Unravel Achievement Disparities in Georgia	Jan 25, 2024	Math	—Unverified	0
Using Java Geometry Expert as Guide in the Preparations for Math Contests	Jan 22, 2024	Math	—Unverified	0
Self-Imagine: Effective Unimodal Reasoning with Multimodal Models using Self-Imagination	Jan 16, 2024	GSM8KLanguage Modeling	—Unverified	0
CHAMP: A Competition-level Dataset for Fine-Grained Analyses of LLMs' Mathematical Reasoning Capabilities	Jan 13, 2024	MathMathematical Reasoning	—Unverified	0
Cramer-Rao bound and absolute sensitivity in chemical reaction networks	Jan 13, 2024	MathSensitivity	—Unverified	0
Using Large Language Models to Assess Tutors' Performance in Reacting to Students Making Math Errors	Jan 6, 2024	Math	—Unverified	0
Graph2Tac: Online Representation Learning of Formal Math Concepts	Jan 5, 2024	AI AgentAutomated Theorem Proving	—Unverified	0
Mastery Guided Non-parametric Clustering to Scale-up Strategy Prediction	Jan 4, 2024	ClusteringFairness	—Unverified	0
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities	Dec 22, 2023	ChatbotGSM8K	—Unverified	0
From Good to Great: Improving Math Reasoning with Tool-Augmented Interleaf Prompting	Dec 18, 2023	DiversityGSM8K	—Unverified	0
TinyGSM: achieving >80% on GSM8k with small language models	Dec 14, 2023	Arithmetic ReasoningGSM8K	—Unverified	0
Fewer is More: Boosting LLM Reasoning with Reinforced Context Pruning	Dec 14, 2023	Arithmetic ReasoningFew-Shot Learning	—Unverified	0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models	Dec 11, 2023	DiversityMath	—Unverified	0
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning	Dec 7, 2023	In-Context LearningMath	—Unverified	0
ChatGPT as a Math Questioner? Evaluating ChatGPT on Generating Pre-university Math Questions	Dec 4, 2023	Arithmetic ReasoningMath	CodeCode Available	0
REDS: Resource-Efficient Deep Subnetworks for Dynamic Resource Constraints	Nov 22, 2023	Computational EfficiencyMath	—Unverified	0

Show:10 25 50

← PrevPage 47 of 64Next →

No leaderboard results yet.