| Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics | Dec 4, 2020 | EthicsMath | —Unverified | 0 | 0 |
| The Tangent Search Engine: Improved Similarity Metrics and Scalability for Math Formula Search | Jul 22, 2015 | Information RetrievalMath | —Unverified | 0 | 0 |
| How well do Computers Solve Math Word Problems? Large-Scale Dataset Construction and Evaluation | Aug 1, 2016 | Community Question AnsweringMath | —Unverified | 0 | 0 |
| Weighted Polynomial Approximations: Limits for Learning and Pseudorandomness | Dec 8, 2014 | Math | —Unverified | 0 | 0 |
| How You See Me | Nov 20, 2018 | Math | —Unverified | 0 | 0 |
| Human Learning about AI | Jun 8, 2024 | Math | —Unverified | 0 | 0 |
| Hydrodynamics of Markets:Hidden Links Between Physics and Finance | Mar 14, 2024 | Math | —Unverified | 0 | 0 |
| HyperCLOVA X Technical Report | Apr 2, 2024 | Instruction FollowingMachine Translation | —Unverified | 0 | 0 |
| Hypothesis-Driven Theory-of-Mind Reasoning for Large Language Models | Feb 17, 2025 | Math | —Unverified | 0 | 0 |
| Identifying equivalent Calabi--Yau topologies: A discrete challenge from math and physics for machine learning | Feb 15, 2022 | BIG-bench Machine LearningMath | —Unverified | 0 | 0 |
| Illinois Math Solver: Math Reasoning on the Web | Jun 1, 2016 | MathMath Word Problem Solving | —Unverified | 0 | 0 |
| The Unreasonable Effectiveness of Model Merging for Cross-Lingual Transfer in LLMs | May 23, 2025 | Cross-Lingual TransferMath | —Unverified | 0 | 0 |
| Improve Mathematical Reasoning in Language Models by Automated Process Supervision | Jun 5, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Improving Academic Plagiarism Detection for STEM Documents by Analyzing Mathematical Content and Citations | Jun 27, 2019 | Math | —Unverified | 0 | 0 |
| Improving Assessment of Tutoring Practices using Retrieval-Augmented Generation | Feb 4, 2024 | HallucinationMath | —Unverified | 0 | 0 |
| Improving Automated Distractor Generation for Math Multiple-choice Questions with Overgenerate-and-rank | Apr 19, 2024 | Distractor GenerationMath | —Unverified | 0 | 0 |
| Improving Complex Reasoning with Dynamic Prompt Corruption: A soft prompt Optimization Approach | Mar 17, 2025 | GSM8KMath | —Unverified | 0 | 0 |
| Improving Equation Set Problems with Label Augmentation | Nov 16, 2021 | DecoderMath | —Unverified | 0 | 0 |
| Improving Large Language Model Fine-tuning for Solving Math Problems | Oct 16, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Improving LLM Reasoning through Scaling Inference Computation with Collaborative Verification | Oct 5, 2024 | GSM8KMath | —Unverified | 0 | 0 |
| Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring | Oct 29, 2024 | Math | —Unverified | 0 | 0 |
| Improving Math Word Problems with Pre-trained Knowledge and Hierarchical Reasoning | Nov 1, 2021 | MathSentence | —Unverified | 0 | 0 |
| Improving Multilingual Math Reasoning for African Languages | May 26, 2025 | MathMathematical Reasoning | —Unverified | 0 | 0 |
| The Word is Mightier than the Label: Learning without Pointillistic Labels using Data Programming | Aug 24, 2021 | Mathtext-classification | —Unverified | 0 | 0 |
| In between myth and reality: AI for math -- a case study in category theory | Apr 17, 2025 | Math | —Unverified | 0 | 0 |