Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 401–450 of 1596 papers

Title	Date	Tasks	Status	Hype
Large Language Models Are Neurosymbolic Reasoners	Jan 17, 2024	Common Sense ReasoningMath	CodeCode Available	1
Augmenting Math Word Problems via Iterative Question Composing	Jan 17, 2024	MathMathematical Reasoning	CodeCode Available	1
Evaluating LLMs' Mathematical and Coding Competency through Ontology-guided Interventions	Jan 17, 2024	Arithmetic ReasoningCode Generation	CodeCode Available	1
The Benefits of a Concise Chain of Thought on Problem-Solving in Large Language Models	Jan 11, 2024	MathMultiple-choice	CodeCode Available	1
Language Models Encode the Value of Numbers Linearly	Jan 8, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
MR-GSM8K: A Meta-Reasoning Benchmark for Large Language Model Evaluation	Dec 28, 2023	GSM8KLanguage Model Evaluation	CodeCode Available	1
An In-depth Look at Gemini's Language Abilities	Dec 18, 2023	Instruction FollowingMath	CodeCode Available	1
Math-Shepherd: Verify and Reinforce LLMs Step-by-step without Human Annotations	Dec 14, 2023	Arithmetic ReasoningGSM8K	CodeCode Available	1
Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent	Dec 14, 2023	Language ModelingLanguage Modelling	CodeCode Available	1
Get an A in Math: Progressive Rectification Prompting	Dec 11, 2023	Math	CodeCode Available	1
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers	Dec 7, 2023	MathMultiple-choice	CodeCode Available	1
Eliciting Latent Knowledge from Quirky Language Models	Dec 2, 2023	Anomaly DetectionMath	CodeCode Available	1
MathGloss: Building mathematical glossaries from text	Nov 21, 2023	Math	CodeCode Available	1
DocMath-Eval: Evaluating Math Reasoning Capabilities of LLMs in Understanding Long and Specialized Documents	Nov 16, 2023	Math	CodeCode Available	1
FinanceMath: Knowledge-Intensive Math Reasoning in Finance Domains	Nov 16, 2023	MathMath Word Problem Solving	CodeCode Available	1
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving	Nov 15, 2023	Math	CodeCode Available	1
Towards Reasoning in Large Language Models via Multi-Agent Peer Review Collaboration	Nov 14, 2023	DiversityMath	CodeCode Available	1
Conic10K: A Challenging Math Problem Understanding and Reasoning Dataset	Nov 9, 2023	MathNatural Language Understanding	CodeCode Available	1
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs	Nov 8, 2023	FairnessMath	CodeCode Available	1
Implicit Chain of Thought Reasoning via Knowledge Distillation	Nov 2, 2023	Knowledge DistillationMath	CodeCode Available	1
Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations	Oct 31, 2023	GSM8KMath	CodeCode Available	1
Learning From Mistakes Makes LLM Better Reasoner	Oct 31, 2023	GSM8KMath	CodeCode Available	1
An Early Evaluation of GPT-4V(ision)	Oct 25, 2023	Math	CodeCode Available	1
Expression Syntax Information Bottleneck for Math Word Problems	Oct 24, 2023	Math	CodeCode Available	1
Plan, Verify and Switch: Integrated Reasoning with Diverse X-of-Thoughts	Oct 23, 2023	Logical ReasoningMath	CodeCode Available	1
Teaching Language Models to Self-Improve through Interactive Demonstrations	Oct 20, 2023	Math	CodeCode Available	1
Bridging the Novice-Expert Gap via Models of Decision-Making: A Case Study on Remediating Math Mistakes	Oct 16, 2023	Decision MakingMath	CodeCode Available	1
Don't Fine-Tune, Decode: Syntax Error-Free Tool Use via Constrained Decoding	Oct 10, 2023	Mathvalid	CodeCode Available	1
Retrieval-augmented Generation to Improve Math Question-Answering: Trade-offs Between Groundedness and Human Preference	Oct 4, 2023	MathQuestion Answering	CodeCode Available	1
A Dynamic LLM-Powered Agent Network for Task-Oriented Agent Collaboration	Oct 3, 2023	Arithmetic ReasoningCode Generation	CodeCode Available	1
SNIP: Bridging Mathematical Symbolic and Numeric Realms with Unified Pre-training	Oct 3, 2023	Contrastive LearningEquation Discovery	CodeCode Available	1
FELM: Benchmarking Factuality Evaluation of Large Language Models	Oct 1, 2023	BenchmarkingMath	CodeCode Available	1
NLPBench: Evaluating Large Language Models on Solving NLP Problems	Sep 27, 2023	BenchmarkingMath	CodeCode Available	1
Design of Chain-of-Thought in Math Problem Solving	Sep 20, 2023	DiversityGSM8K	CodeCode Available	1
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning	Sep 19, 2023	Instruction FollowingLanguage Modeling	CodeCode Available	1
Towards an AI to Win Ghana's National Science and Maths Quiz	Aug 8, 2023	MathQuestion Answering	CodeCode Available	1
Studying Large Language Model Generalization with Influence Functions	Aug 7, 2023	counterfactualLanguage Modeling	CodeCode Available	1
A Symbolic Character-Aware Model for Solving Geometry Problems	Aug 5, 2023	MathMulti-Label Classification	CodeCode Available	1
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step Reasoning	Aug 1, 2023	GSM8KMath	CodeCode Available	1
SIGHT: A Large Annotated Dataset on Student Insights Gathered from Higher Education Transcripts	Jun 15, 2023	Math	CodeCode Available	1
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom Instruction	Jun 5, 2023	Math	CodeCode Available	1
Evaluating and Improving Tool-Augmented Computation-Intensive Math Reasoning	Jun 4, 2023	Math	CodeCode Available	1
MathChat: Converse to Tackle Challenging Math Problems with LLM Agents	Jun 2, 2023	Elementary MathematicsMath	CodeCode Available	1
Learning Multi-Step Reasoning by Solving Arithmetic Tasks	Jun 2, 2023	MathMathematical Reasoning	CodeCode Available	1
GRACE: Discriminator-Guided Chain-of-Thought Reasoning	May 24, 2023	GSM8KMath	CodeCode Available	1
The Art of SOCRATIC QUESTIONING: Recursive Thinking with Large Language Models	May 24, 2023	Language ModellingMath	CodeCode Available	1
MathDial: A Dialogue Tutoring Dataset with Rich Pedagogical Properties Grounded in Math Reasoning Problems	May 23, 2023	Language ModellingLarge Language Model	CodeCode Available	1
RetICL: Sequential Retrieval of In-Context Examples with Reinforcement Learning	May 23, 2023	In-Context LearningLanguage Modelling	CodeCode Available	1
CREATOR: Tool Creation for Disentangling Abstract and Concrete Reasoning of Large Language Models	May 23, 2023	2kMath	CodeCode Available	1
ChatCoT: Tool-Augmented Chain-of-Thought Reasoning on Chat-based Large Language Models	May 23, 2023	Math	CodeCode Available	1

Show:10 25 50

← PrevPage 9 of 32Next →

No leaderboard results yet.