| Efficient Tool Use with Chain-of-Abstraction Reasoning | Jan 30, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Taxonomy of Mathematical Plagiarism | Jan 30, 2024 | MathQuestion Answering | CodeCode Available | 0 |
| ReGAL: Refactoring Programs to Discover Generalizable Abstractions | Jan 29, 2024 | Date UnderstandingMath | CodeCode Available | 1 |
| GAPS: Geometry-Aware Problem Solver | Jan 29, 2024 | Geometry Problem SolvingMath | —Unverified | 0 |
| YODA: Teacher-Student Progressive Learning for Language Models | Jan 28, 2024 | GSM8KMath | —Unverified | 0 |
| Exploring Educational Equity: A Machine Learning Approach to Unravel Achievement Disparities in Georgia | Jan 25, 2024 | Math | —Unverified | 0 |
| Can AI Assistants Know What They Don't Know? | Jan 24, 2024 | MathOpen-Domain Question Answering | CodeCode Available | 2 |
| TroVE: Inducing Verifiable and Efficient Toolboxes for Solving Programmatic Tasks | Jan 23, 2024 | MathQuestion Answering | CodeCode Available | 1 |
| Using Java Geometry Expert as Guide in the Preparations for Math Contests | Jan 22, 2024 | Math | —Unverified | 0 |
| SuperCLUE-Math6: Graded Multi-Step Math Reasoning Benchmark for LLMs in Chinese | Jan 22, 2024 | DiversityGSM8K | CodeCode Available | 2 |