SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 876–900 of 1596 papers

Title	Date	Tasks	Status	Hype
Reverse Thinking Makes LLMs Stronger Reasoners	Nov 29, 2024	Data AugmentationKnowledge Distillation	—Unverified	0
Mars-PO: Multi-Agent Reasoning System Preference Optimization	Nov 28, 2024	MathMathematical Reasoning	—Unverified	0
A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems	Nov 28, 2024	LEMMAMath	—Unverified	0
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students	Nov 27, 2024	Language ModelingLanguage Modelling	—Unverified	0
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS	Nov 27, 2024	In-Context LearningMath	CodeCode Available	0
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Nov 25, 2024	MathMath Word Problem Solving	—Unverified	0
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures	Nov 25, 2024	GSM8KMath	—Unverified	0
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training	Nov 21, 2024	Math	—Unverified	0
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available	0
RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing	Nov 13, 2024	DecoderMath	CodeCode Available	0
OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving?	Nov 9, 2024	Logical ReasoningMath	—Unverified	0
VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM	Nov 8, 2024	Math	—Unverified	0
Meta-Reasoning Improves Tool Use in Large Language Models	Nov 7, 2024	Math	CodeCode Available	0
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams	Nov 7, 2024	Math	—Unverified	0
Self-Consistency Preference Optimization	Nov 6, 2024	GSM8KMath	—Unverified	0
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified	0
Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question Classification	Nov 4, 2024	MathReranking	CodeCode Available	0
Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models	Nov 2, 2024	GSM8KMath	—Unverified	0
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing	Nov 1, 2024	2kIn-Context Learning	—Unverified	0
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models	Oct 29, 2024	MathMathematical Reasoning	—Unverified	0
Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses	Oct 29, 2024	MathZero-Shot Learning	—Unverified	0
Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring	Oct 29, 2024	Math	—Unverified	0
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Oct 28, 2024	ARCMath	—Unverified	0
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?	Oct 27, 2024	Data AugmentationMath	CodeCode Available	0
Library Learning Doesn't: The Curious Case of the Single-Use "Library"	Oct 26, 2024	MathMathematical Reasoning	CodeCode Available	0

Show:10 25 50

← PrevPage 36 of 64Next →

No leaderboard results yet.