SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1071–1080 of 1596 papers

Title	Date	Tasks	Status	Hype
Critique Ability of Large Language Models	Oct 7, 2023	Code CompletionDecision Making	—Unverified	0
Analysis of the Reasoning with Redundant Information Provided Ability of Large Language Models	Oct 6, 2023	8kMath	—Unverified	0
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models	Oct 6, 2023	Code GenerationDecision Making	CodeCode Available	2
DSPy: Compiling Declarative Language Model Calls into Self-Improving Pipelines	Oct 5, 2023	Language ModelingLanguage Modelling	CodeCode Available	7
MathCoder: Seamless Code Integration in LLMs for Enhanced Mathematical Reasoning	Oct 5, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	2
Concise and Organized Perception Facilitates Reasoning in Large Language Models	Oct 5, 2023	LAMBADAMath	—Unverified	0
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference	Oct 4, 2023	MathQuestion Answering	CodeCode Available	1
The Rise of Open Science: Tracking the Evolution and Perceived Value of Data and Methods Link-Sharing Practices	Oct 4, 2023	ArticlesMath	CodeCode Available	0
Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions	Oct 3, 2023	MathMathematical Reasoning	—Unverified	0
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance	Oct 3, 2023	Code GenerationLogical Reasoning	CodeCode Available	0

Show:10 25 50

← PrevPage 108 of 160Next →

No leaderboard results yet.