What makes math problems hard for reinforcement learning: a case study

2024-08-27Code Available1· sign in to hype

Ali Shehper, Anibal M. Medina-Mardones, Lucas Fagan, Bartłomiej Lewandowski, Angus Gruen, Yang Qiu, Piotr Kucharski, Zhenghan Wang, Sergei Gukov

arXiv PDF

Code Available — Be the first to reproduce this paper.

Reproduce

Code

github.com/shehper/AC-Solver
OfficialIn paperpytorch★ 32

Abstract

Using a long-standing conjecture from combinatorial group theory, we explore, from multiple perspectives, the challenges of finding rare instances carrying disproportionately high rewards. Based on lessons learned in the context defined by the Andrews-Curtis conjecture, we propose algorithmic enhancements and a topological hardness measure with implications for a broad class of search problems. As part of our study, we also address several open mathematical questions. Notably, we demonstrate the length reducibility of all but two presentations in the Akbulut-Kirby series (1981), and resolve various potential counterexamples in the Miller-Schupp series (1991), including three infinite subfamilies.

Tasks

Math Reinforcement Learning (RL)

What makes math problems hard for reinforcement learning: a case study

Code

Abstract

Tasks

Reproductions