Automated Theorem Proving

The goal of Automated Theorem Proving is to automatically generate a proof, given a conjecture (the target theorem) and a knowledge base of known facts, all expressed in a formal language. Automated Theorem Proving is useful in a wide range of applications, including the verification and synthesis of software and hardware systems.

Source: Learning to Prove Theorems by Learning to Generate Theorems

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–200 of 288 papers

Title	Date	Tasks	Status	Hype
Autoformalization with Large Language Models	May 25, 2022	Automated Theorem ProvingProgram Synthesis	—Unverified	0
HyperTree Proof Search for Neural Theorem Proving	May 23, 2022	Automated Theorem Proving	—Unverified	0
From Width-Based Model Checking to Width-Based Automated Theorem Proving	May 23, 2022	Automated Theorem Provingvalid	—Unverified	0
Thor: Wielding Hammers to Integrate Language Models and Automated Theorem Provers	May 22, 2022	Automated Theorem Proving	—Unverified	0
The Isabelle ENIGMA	May 4, 2022	Automated Theorem Proving	CodeCode Available	0
Logically Consistent Adversarial Attacks for Soft Theorem Provers	Apr 29, 2022	Automated Theorem Proving	CodeCode Available	0
Adversarial Learning to Reason in an Arbitrary Logic	Apr 6, 2022	Automated Theorem Proving	—Unverified	0
The Proof is in the Pudding: Using Automated Theorem Proving to Generate Cooking Recipes	Mar 5, 2022	Automated Theorem ProvingText Generation	CodeCode Available	0
Automated Reasoning in Non-classical Logics in the TPTP World	Feb 20, 2022	Automated Theorem ProvingPhilosophy	—Unverified	0
Selection Strategies for Commonsense Knowledge	Feb 18, 2022	Automated Theorem ProvingWord Embeddings	—Unverified	0
From the String Landscape to the Mathematical Landscape: a Machine-Learning Outlook	Feb 12, 2022	Automated Theorem ProvingBIG-bench Machine Learning	—Unverified	0
Vehicle: Interfacing Neural Network Verifiers with Interactive Theorem Provers	Feb 10, 2022	Automated Theorem Proving	—Unverified	0
Formal Mathematics Statement Curriculum Learning	Feb 3, 2022	Automated Theorem ProvingLanguage Modeling	CodeCode Available	2
Proceedings 10th International Workshop on Theorem Proving Components for Educational Software	Feb 2, 2022	Automated Theorem Proving	—Unverified	0
Proceedings of the 13th International Conference on Automated Deduction in Geometry	Dec 28, 2021	Automated Theorem Proving	—Unverified	0
Proving Theorems using Incremental Learning and Hindsight Experience Replay	Dec 20, 2021	Automated Theorem ProvingIncremental Learning	—Unverified	0
Linear algebra with transformers	Dec 3, 2021	Automated Theorem ProvingFew-Shot Learning	CodeCode Available	1
Learning Symbolic Rules for Reasoning in Quasi-Natural Language	Nov 23, 2021	Automated Theorem ProvingFormal Logic	CodeCode Available	0
Logically Sound Arguments for the Effectiveness of ML Safety Measures	Nov 4, 2021	Automated Theorem Proving	—Unverified	0
Applying Second-Order Quantifier Elimination in Inspecting Gödel's Ontological Proof	Oct 21, 2021	Automated Theorem Proving	—Unverified	0
Proof Extraction for Logical Neural Networks	Oct 8, 2021	Automated Theorem Proving	—Unverified	0
An energy-based model for neuro-symbolic reasoning on knowledge graphs	Oct 4, 2021	Automated Theorem ProvingGraph Embedding	CodeCode Available	1
Linear algebra with transformers	Sep 29, 2021	Automated Theorem ProvingFew-Shot Learning	—Unverified	0
Neural Unification for Logic Reasoning over Natural Language	Sep 17, 2021	Automated Theorem ProvingQuestion Answering	CodeCode Available	1
Proceedings 37th International Conference on Logic Programming (Technical Communications)	Sep 15, 2021	Automated Theorem ProvingData Integration	—Unverified	0
Conjectures, Tests and Proofs: An Overview of Theory Exploration	Sep 7, 2021	Automated Theorem ProvingMathematical Reasoning	—Unverified	0
AI Descartes: Combining Data and Theory for Derivable Scientific Discovery	Sep 3, 2021	Automated Theorem ProvingBIG-bench Machine Learning	CodeCode Available	1
MiniF2F: a cross-system benchmark for formal Olympiad-level mathematics	Aug 31, 2021	Automated Theorem Proving	CodeCode Available	1
The Horn Non-Clausal Class and its Polynomiality	Aug 31, 2021	Automated Theorem Proving	—Unverified	0
ProoFVer: Natural Logic Theorem Proving for Fact Verification	Aug 25, 2021	Automated Theorem Provingcounterfactual	CodeCode Available	1
Graph Contrastive Pre-training for Effective Theorem Reasoning	Aug 24, 2021	Automated Theorem ProvingContrastive Learning	—Unverified	0
Learning Theorem Proving Components	Jul 21, 2021	Automated Theorem ProvingGraph Neural Network	CodeCode Available	1
Learning to Guide a Saturation-Based Theorem Prover	Jun 7, 2021	Automated Theorem ProvingGraph Neural Network	—Unverified	0
The Role of Entropy in Guiding a Connection Prover	May 31, 2021	Automated Theorem ProvingDecision Making	—Unverified	0
NaturalProofs: Mathematical Theorem Proving in Natural Language	Mar 24, 2021	Automated Theorem ProvingDomain Generalization	CodeCode Available	1
Training a First-Order Theorem Prover from Synthetic Data	Mar 5, 2021	Automated Theorem ProvingBIG-bench Machine Learning	—Unverified	0
TacticZero: Learning to Prove Theorems from Scratch with Deep Reinforcement Learning	Feb 19, 2021	Automated Theorem ProvingDeep Reinforcement Learning	—Unverified	0
Proof Artifact Co-training for Theorem Proving with Language Models	Feb 11, 2021	Automated Theorem ProvingImitation Learning	CodeCode Available	1
Learning Equational Theorem Proving	Feb 10, 2021	Automated Theorem ProvingDeep Reinforcement Learning	—Unverified	0
Learning to Match Mathematical Statements with Proofs	Feb 3, 2021	ArticlesAutomated Theorem Proving	CodeCode Available	0
A Study of Continuous Vector Representationsfor Theorem Proving	Jan 22, 2021	Automated Theorem Proving	—Unverified	0
A Curious New Result of Resolution Strategies in Negation-Limited Inverters Problem	Nov 2, 2020	Automated Theorem ProvingNegation	—Unverified	0
Learning as Abduction: Trainable Natural Logic Theorem Prover for Natural Language Inference	Oct 29, 2020	Automated Theorem ProvingNatural Language Inference	CodeCode Available	1
Proceedings 9th International Workshop on Theorem Proving Components for Educational Software	Oct 28, 2020	Automated Theorem Proving	—Unverified	0
Measuring Systematic Generalization in Neural Proof Generation with Transformers	Sep 30, 2020	Automated Theorem ProvingLogical Reasoning	CodeCode Available	1
Deriving Theorems in Implicational Linear Logic, Declaratively	Sep 22, 2020	Automated Theorem Proving	—Unverified	0
Proceedings 36th International Conference on Logic Programming (Technical Communications)	Sep 19, 2020	Automated Theorem ProvingData Integration	—Unverified	0
Generative Language Modeling for Automated Theorem Proving	Sep 7, 2020	Automated Theorem ProvingLanguage Modeling	—Unverified	0
INT: An Inequality Benchmark for Evaluating Generalization in Theorem Proving	Jul 6, 2020	Automated Theorem Proving	CodeCode Available	1
Modelling Value-oriented Legal Reasoning in LogiKEy	Jun 23, 2020	Automated Theorem ProvingLegal Reasoning	—Unverified	0

Show:10 25 50

← PrevPage 4 of 6Next →

All datasets miniF2F-test miniF2F-valid HolStep (Conditional)HOList benchmark HolStep (Unconditional)Metamath set.mm miniF2F-curriculum CompCert CoqGym

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Kimina-Prover-Preview	cumulative	80.74	—	Unverified
2	ProofAug	cumulative	66	—	Unverified
3	DeepSeek-Prover-V1.5	cumulative	63.5	—	Unverified
4	Subgoal-XL	cumulative	56.1	—	Unverified
5	DeepSeek-Prover	cumulative	52	—	Unverified
6	Lyra + GPT-4	cumulative	47.1	—	Unverified
7	LEGO-Prover ChatGPT	cumulative	47.1	—	Unverified
8	Decomposing the Enigma	cumulative	45.5	—	Unverified
9	Evariste	cumulative	41	—	Unverified
10	Evariste-7d	cumulative	40.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Evariste	Pass@64	58.6	—	Unverified
2	LEGO-Prover ChatGPT	Pass@100	57	—	Unverified
3	Lyra + GPT-4	Pass@100	52	—	Unverified
4	Evariste-7d	Pass@64	47.5	—	Unverified
5	GPT-f	Pass@64	47.3	—	Unverified
6	Evariste-1d	Pass@64	46.7	—	Unverified
7	DSP (62B Minerva informal)	Pass@100	43.9	—	Unverified
8	Lean GPT-f	Pass@8	29.3	—	Unverified
9	Lean tidy	Pass@1	16.8	—	Unverified
10	Metamath GPT-f	Pass@8	2	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MPNN-DagLSTM	Classification Accuracy	0.92	—	Unverified
2	FormulaNet	Classification Accuracy	0.9	—	Unverified
3	FormulaNet-basic	Classification Accuracy	0.89	—	Unverified
4	Siamese 1D CNN-LSTM	Classification Accuracy	0.83	—	Unverified
5	Siamese 1D CNN	Classification Accuracy	0.82	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	4-hop GNN, sub-expression sharing	Percentage correct	49.95	—	Unverified
2	Tactic Dependent Loop	Percentage correct	38.88	—	Unverified
3	BoW2 (extra -ves)	Percentage correct	36.55	—	Unverified
4	Deeper Wider WaveNet	Percentage correct	32.65	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	FormulaNet	Classification Accuracy	0.9	—	Unverified
2	FormulaNet-basic	Classification Accuracy	0.89	—	Unverified
3	1D CNN	Classification Accuracy	0.83	—	Unverified
4	1D CNN-LSTM	Classification Accuracy	0.83	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Evariste	Pass@32	72.4	—	Unverified
2	GPT-f	Percentage correct	56.2	—	Unverified
3	MetaGen-IL + Holophrasm	Percentage correct	22.1	—	Unverified
4	Holophrasm	Percentage correct	14.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Evariste-7d	Pass@64	42.5	—	Unverified
2	Evariste-1d	Pass@64	33.6	—	Unverified
3	Evariste	Pass@64	32.1	—	Unverified
4	GPT-f	Pass@64	30.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Proverbot9001	Percentage correct	19.36	—	Unverified
2	CoqGym/ASTactic	Percentage correct	4.99	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ASTactic	Percentage correct	12.2	—	Unverified