| AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays | Apr 24, 2023 | Math | —Unverified | 0 |
| Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers | Apr 21, 2023 | MathMultiple-choice | —Unverified | 0 |
| Progressive-Hint Prompting Improves Reasoning in Large Language Models | Apr 19, 2023 | Arithmetic ReasoningGSM8K | CodeCode Available | 2 |
| Enhancing Textbooks with Visuals from the Web for Improved Learning | Apr 18, 2023 | Math | CodeCode Available | 0 |
| Metric-agnostic Ranking Optimization | Apr 17, 2023 | Information RetrievalLearning-To-Rank | —Unverified | 0 |
| What Makes a Good Dataset for Symbol Description Reading? | Apr 17, 2023 | document understandingMath | —Unverified | 0 |
| Solving Math Word Problems by Combining Language Models With Symbolic Solvers | Apr 16, 2023 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| Gamifying Math Education using Object Detection | Apr 13, 2023 | MathObject | —Unverified | 0 |
| AGIEval: A Human-Centric Benchmark for Evaluating Foundation Models | Apr 13, 2023 | Decision MakingMath | CodeCode Available | 2 |
| Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task | Apr 11, 2023 | Deep Reinforcement LearningExplainable artificial intelligence | —Unverified | 0 |