Reinforced Large Language Model is a formal theorem prover
2025-02-13Code Available0· sign in to hype
Zhiling Luo
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/zhilingluo/theorem_proverOfficialnone★ 3
Abstract
To take advantage of Large Language Model in theorem formalization and proof, we propose a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones. The experiment results show that it helps to achieve a higher accuracy compared with directly fine-tuned LLM.