SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 431–440 of 1596 papers

Title	Date	Tasks	Status	Hype	Score
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts	Oct 23, 2023	Logical ReasoningMath	CodeCode Available	1	5
A Relation Spectrum Inheriting Taylor Series: Muscle Synergy and Coupling for Hand	Apr 25, 2020	MathRelation	CodeCode Available	1	5
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data	Feb 14, 2024	Automated Theorem ProvingLanguage Modelling	CodeCode Available	1	5
Multiple-Choice Questions are Efficient and Robust LLM Evaluators	May 20, 2024	GSM8KHumanEval	CodeCode Available	1	5
Aioli: A Unified Optimization Framework for Language Model Data Mixing	Nov 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1	5
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical Reasoning	Sep 29, 2022	Logical ReasoningMath	CodeCode Available	1	5
BlenderGym: Benchmarking Foundational Model Systems for Graphics Editing	Apr 2, 2025	3D ReconstructionBenchmarking	CodeCode Available	1	5
Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes	Oct 16, 2023	Decision MakingMath	CodeCode Available	1	5
CityGPT: Empowering Urban Spatial Cognition of Large Language Models	Jun 20, 2024	Code GenerationMath	CodeCode Available	1	5
On the Resilience of LLM-Based Multi-Agent Collaboration with Faulty Agents	Aug 2, 2024	Code GenerationLarge Language Model	CodeCode Available	1	5

Show:10 25 50

← PrevPage 44 of 160Next →

No leaderboard results yet.