SOTAVerified

Math

Papers

Showing 11011150 of 1596 papers

TitleStatusHype
Using Large Language Model to Solve and Explain Physics Word Problems Approaching Human Level0
MAmmoTH: Building Math Generalist Models through Hybrid Instruction TuningCode2
GPT Can Solve Mathematical Problems Without a CalculatorCode2
MathAttack: Attacking Large Language Models Towards Math Solving Ability0
Solving Math Word Problem with Problem Type ClassificationCode0
WizardMath: Empowering Mathematical Reasoning for Large Language Models via Reinforced Evol-InstructCode5
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach0
Solving Challenging Math Word Problems Using GPT-4 Code Interpreter with Code-based Self-VerificationCode2
Testing GPT-4 with Wolfram Alpha and Code Interpreter plug-ins on math and science problems0
Towards an AI to Win Ghana's National Science and Maths QuizCode1
NEOLAF, an LLM-powered neural-symbolic cognitive architecture0
Cumulative Reasoning with Large Language ModelsCode2
Scalable and Equitable Math Problem Solving Strategy Prediction in Big Educational DataCode0
Automated Distractor and Feedback Generation for Math Multiple-choice Questions via In-context LearningCode0
Studying Large Language Model Generalization with Influence FunctionsCode1
A Symbolic Character-Aware Model for Solving Geometry ProblemsCode1
MM-Vet: Evaluating Large Multimodal Models for Integrated CapabilitiesCode2
Reasoning in Large Language Models Through Symbolic Math Word ProblemsCode0
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models0
SelfCheck: Using LLMs to Zero-Shot Check Their Own Step-by-Step ReasoningCode1
Augmented Math: Authoring AR-Based Explorable Explanations by Augmenting Static Math TextbooksCode0
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
ARB: Advanced Reasoning Benchmark for Large Language Models0
Explaining Math Word Problem Solvers0
Controlling Equational Reasoning in Large Language Models with Prompt Interventions0
How is ChatGPT's behavior changing over time?Code4
A mixed policy to improve performance of language models on math problemsCode0
Math Agents: Computational Infrastructure, Mathematical Embedding, and Genomics0
MWPRanker: An Expression Similarity Based Math Word Problem Retriever0
CMATH: Can Your Language Model Pass Chinese Elementary School Math Test?0
LeanDojo: Theorem Proving with Retrieval-Augmented Language ModelsCode2
Let's Do a Thought Experiment: Using Counterfactuals to Improve Moral Reasoning0
Math Word Problem Solving by Generating Linguistic Variants of Problem StatementsCode0
A Survey on Multimodal Large Language Models0
Public Attitudes Toward ChatGPT on Twitter: Sentiments, Topics, and OccupationsCode0
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models0
Learning by Analogy: Diverse Questions Generation in Math Word ProblemCode0
SIGHT: A Large Annotated Dataset on Student Insights Gathered from Higher Education TranscriptsCode1
A Neural Network Implementation for Free Energy Principle0
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination0
PromptRobust: Towards Evaluating the Robustness of Large Language Models on Adversarial Prompts0
World Models for Math Story ProblemsCode0
Is ChatGPT a Good Teacher Coach? Measuring Zero-Shot Performance For Scoring and Providing Actionable Insights on Classroom InstructionCode1
Evaluating and Improving Tool-Augmented Computation-Intensive Math ReasoningCode1
Does ChatGPT Comprehend the Place Value in Numbers When Solving Math Word Problems?Code0
MathChat: Converse to Tackle Challenging Math Problems with LLM AgentsCode1
Learning Multi-Step Reasoning by Solving Arithmetic TasksCode1
AWQ: Activation-aware Weight Quantization for LLM Compression and AccelerationCode6
Inspecting Spoken Language Understanding from Kids for Basic Math Learning at Home0
Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions0
Show:102550
← PrevPage 23 of 32Next →

No leaderboard results yet.