SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–860 of 1596 papers

Title	Date	Tasks	Status	Hype
VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual Context	May 8, 2024	MathMathematical Reasoning	CodeCode Available	1
MAmmoTH2: Scaling Instructions from the Web	May 6, 2024	ChatbotGSM8K	—Unverified	0
Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning	May 5, 2024	GSM8KMath	CodeCode Available	2
Assessing and Verifying Task Utility in LLM-Powered Applications	May 3, 2024	Math	—Unverified	0
Math Multiple Choice Question Generation via Human-Large Language Model Collaboration	May 1, 2024	Language ModelingLanguage Modelling	—Unverified	0
GOLD: Geometry Problem Solver with Natural Language Description	May 1, 2024	Math	CodeCode Available	1
A Careful Examination of Large Language Model Performance on Grade School Arithmetic	May 1, 2024	GSM8KLanguage Modeling	—Unverified	0
Monte Carlo Tree Search Boosts Reasoning via Iterative Preference Learning	May 1, 2024	ARCGSM8K	CodeCode Available	3
Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models	May 1, 2024	Math	—Unverified	0
Iterative Reasoning Preference Optimization	Apr 30, 2024	ARCGSM8K	—Unverified	0

Show:10 25 50

← PrevPage 86 of 160Next →

No leaderboard results yet.