Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1201–1225 of 1596 papers

Title	Date	Tasks	Status
Instances Need More Care: Rewriting Prompts for Instances with LLMs in the Loop Yields Better Zero-Shot Performance	Oct 3, 2023	Code GenerationLogical Reasoning	CodeCode Available
Benchmarking and Improving Generator-Validator Consistency of Language Models	Oct 3, 2023	BenchmarkingInstruction Following	—Unverified
Novice Learner and Expert Tutor: Evaluating Math Reasoning Abilities of Large Language Models with Misconceptions	Oct 3, 2023	MathMathematical Reasoning	—Unverified
Fill in the Blank: Exploring and Enhancing LLM Capabilities for Backward Reasoning in Math Word Problems	Oct 3, 2023	GSM8KMath	CodeCode Available
Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting	Sep 30, 2023	Math	—Unverified
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models	Sep 29, 2023	Code GenerationMath	—Unverified
Fairness Hub Technical Briefs: AUC Gap	Sep 20, 2023	FairnessMath	—Unverified
Contrastive Decoding Improves Reasoning in Large Language Models	Sep 17, 2023	GSM8KHellaSwag	—Unverified
Odd period cycles and ergodic properties in price dynamics for an exchange economy	Sep 17, 2023	Math	—Unverified
ChatGPT-4 with Code Interpreter can be used to solve introductory college-level vector calculus and electromagnetism problems	Sep 16, 2023	Electrical EngineeringMath	—Unverified
Using Large Language Model to Solve and Explain Physics Word Problems Approaching Human Level	Sep 15, 2023	Few-Shot LearningHigh School Physics	—Unverified
MathAttack: Attacking Large Language Models Towards Math Solving Ability	Sep 4, 2023	Adversarial AttackGSM8K	—Unverified
Solving Math Word Problem with Problem Type Classification	Aug 26, 2023	Answer SelectionClassification	CodeCode Available
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach	Aug 18, 2023	Math	—Unverified
Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems	Aug 10, 2023	Language ModelingLanguage Modelling	—Unverified
NEOLAF, an LLM-powered neural-symbolic cognitive architecture	Aug 8, 2023	Incremental LearningMath	—Unverified
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational Data	Aug 7, 2023	MathMisconceptions	CodeCode Available
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context Learning	Aug 7, 2023	In-Context LearningMath	CodeCode Available
Reasoning in Large Language Models Through Symbolic Math Word Problems	Aug 3, 2023	Math	CodeCode Available
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models	Aug 1, 2023	In-Context LearningMath	—Unverified
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math Textbooks	Jul 30, 2023	MathOptical Character Recognition	CodeCode Available
A large language model-assisted education tool to provide feedback on open-ended responses	Jul 25, 2023	Language ModelingLanguage Modelling	CodeCode Available
ARB: Advanced Reasoning Benchmark for Large Language Models	Jul 25, 2023	Math	—Unverified
Explaining Math Word Problem Solvers	Jul 24, 2023	Math	—Unverified
Controlling Equational Reasoning in Large Language Models with Prompt Interventions	Jul 19, 2023	HallucinationIn-Context Learning	—Unverified

Show:10 25 50

← PrevPage 49 of 64Next →

No leaderboard results yet.