| Unlocking Temporal Question Answering for Large Language Models with Tailor-Made Reasoning Logic | May 24, 2023 | Logical ReasoningMath | CodeCode Available | 0 |
| Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation | May 22, 2023 | Knowledge TracingMath | —Unverified | 0 |
| Cognitive network science reveals bias in GPT-3, ChatGPT, and GPT-4 mirroring math anxiety in high-school students | May 22, 2023 | MathText Generation | —Unverified | 0 |
| TEIMMA: The First Content Reuse Annotator for Text, Images, and Math | May 22, 2023 | Math | CodeCode Available | 0 |
| Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate | May 22, 2023 | BenchmarkingMath | —Unverified | 0 |
| Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs | May 19, 2023 | Arithmetic ReasoningGSM8K | —Unverified | 0 |
| A quantitative study of NLP approaches to question difficulty estimation | May 17, 2023 | MathMultiple-choice | CodeCode Available | 0 |
| Learning Non-linguistic Skills without Sacrificing Linguistic Proficiency | May 14, 2023 | Arithmetic ReasoningMath | CodeCode Available | 0 |
| CodeT5+: Open Code Large Language Models for Code Understanding and Generation | May 13, 2023 | Arithmetic ReasoningCode Completion | CodeCode Available | 0 |
| Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces | May 12, 2023 | ClusteringFairness | —Unverified | 0 |
| Algebra Error Classification with Large Language Models | May 8, 2023 | ClassificationMath | CodeCode Available | 0 |
| AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays | Apr 24, 2023 | Math | —Unverified | 0 |
| Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers | Apr 21, 2023 | MathMultiple-choice | —Unverified | 0 |
| Enhancing Textbooks with Visuals from the Web for Improved Learning | Apr 18, 2023 | Math | CodeCode Available | 0 |
| What Makes a Good Dataset for Symbol Description Reading? | Apr 17, 2023 | document understandingMath | —Unverified | 0 |
| Metric-agnostic Ranking Optimization | Apr 17, 2023 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| Gamifying Math Education using Object Detection | Apr 13, 2023 | MathObject | —Unverified | 0 |
| Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task | Apr 11, 2023 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |
| Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases | Mar 26, 2023 | Math | —Unverified | 0 |
| Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval | Mar 22, 2023 | Adversarial RobustnessDeep Hashing | —Unverified | 0 |
| Mind meets machine: Unravelling GPT-4's cognitive psychology | Mar 20, 2023 | Common Sense ReasoningDecision Making | —Unverified | 0 |
| OntoMath^PRO 2.0 Ontology: Updates of the Formal Model | Mar 17, 2023 | ManagementMath | —Unverified | 0 |
| Self-reinforced polynomial approximation methods for concentrated probability densities | Mar 5, 2023 | Math | —Unverified | 0 |
| On the existence of minimizers in shallow residual ReLU neural network optimization landscapes | Feb 28, 2023 | Math | —Unverified | 0 |
| An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) | Feb 23, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 0 |