Reinforced Large Language Model is a formal theorem prover

2025-02-13Code Available0· sign in to hype

Zhiling Luo

Code Available — Be the first to reproduce this paper.

Code

github.com/zhilingluo/theorem_prover
Officialnone★ 3

Abstract

To take advantage of Large Language Model in theorem formalization and proof, we propose a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones. The experiment results show that it helps to achieve a higher accuracy compared with directly fine-tuned LLM.

Tasks

Language Modeling Language Modelling Large Language Model reinforcement-learning Reinforcement Learning

Reinforced Large Language Model is a formal theorem prover

Code

Abstract

Tasks

Reproductions