SOTAVerified

Math

Papers

Showing 451475 of 1596 papers

TitleStatusHype
TheoremQA: A Theorem-driven Question Answering datasetCode1
Non-Autoregressive Math Word Problem Solver with Unified Tree StructureCode1
Solving Math Word Problems by Combining Language Models With Symbolic SolversCode1
From Zero to Hero: Convincing with Extremely Complicated MathCode1
How well do Large Language Models perform in Arithmetic tasks?Code1
SALSA PICANTE: a machine learning attack on LWE with binary secretsCode1
MathPrompter: Mathematical Reasoning using Large Language ModelsCode1
LEVER: Learning to Verify Language-to-Code Generation with ExecutionCode1
Tree-Based Representation and Generation of Natural and Mathematical LanguageCode1
A Categorical Archive of ChatGPT FailuresCode1
Large Language Models Can Be Easily Distracted by Irrelevant ContextCode1
Mathematical Capabilities of ChatGPTCode1
Can an AI Win Ghana's National Science and Maths Quiz? An AI Grand Challenge for EducationCode1
Large Language Models Are Latent Variable Models: Explaining and Finding Good Demonstrations for In-Context LearningCode1
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical ExpressionCode1
Automatic Generation of Socratic Subquestions for Teaching Math Word ProblemsCode1
The NCTE Transcripts: A Dataset of Elementary Math Classroom TranscriptsCode1
Mining Mathematical Documents for Question Answering via Unsupervised Formula LabelingCode1
What is my math transformer doing? -- Three results on interpretability and generalizationCode1
Solving Math Word Problems via Cooperative Reasoning induced Language ModelsCode1
Broken Neural Scaling LawsCode1
A Causal Framework to Quantify the Robustness of Mathematical Reasoning with Language ModelsCode1
Dynamic Prompt Learning via Policy Gradient for Semi-structured Mathematical ReasoningCode1
FormulaNet: A Benchmark Dataset for Mathematical Formula DetectionCode1
CLEVR-Math: A Dataset for Compositional Language, Visual and Mathematical ReasoningCode1
Show:102550
← PrevPage 19 of 64Next →

No leaderboard results yet.