SOTAVerified

Automated Theorem Proving

The goal of Automated Theorem Proving is to automatically generate a proof, given a conjecture (the target theorem) and a knowledge base of known facts, all expressed in a formal language. Automated Theorem Proving is useful in a wide range of applications, including the verification and synthesis of software and hardware systems.

Source: Learning to Prove Theorems by Learning to Generate Theorems

Papers

Showing 251288 of 288 papers

TitleStatusHype
Artificial Neural Networks that Learn to Satisfy Logic Constraints0
Proceedings of the Fifth Workshop on Proof eXchange for Theorem Proving0
Interactive, Intelligent Tutoring for Auxiliary Constructions in Geometry Proofs0
Premise Selection for Theorem Proving by Deep Graph EmbeddingCode0
Evaluating an Automata Approach to Query Containment0
LangPro: Natural Language Theorem ProverCode1
Robust Computer Algebra, Theorem Proving, and Oracle AI0
Count-Invariance Including Exponentials0
COVER: Covering the Semantically Tractable Questions0
On-demand Injection of Lexical Knowledge for Recognising Textual EntailmentCode0
HolStep: A Machine Learning Dataset for Higher-order Logic Theorem ProvingCode0
Theorem Proving Based on Semantics of DNA Strand Graph0
jsCoq: Towards Hybrid Theorem Proving Interfaces0
Semantic Parsing of Mathematics by Context-based Learning from Aligned Corpora and Theorem Proving0
Blocking and Other Enhancements for Bottom-Up Model Generation Methods0
BliStrTune: Hierarchical Invention of Theorem Proving Strategies0
Monte Carlo Tableau Proof Search0
DeepAlgebra - an outline of a program0
Social Network Processes in the Isabelle and Coq Theorem Proving Communities0
Verifier Theory and Unverifiability0
Holophrasm: a neural Automated Theorem Prover for higher-order logicCode0
Natural Solution to FraCaS Entailment Problems0
DeepMath - Deep Sequence Models for Premise SelectionCode0
Extracting Higher-Order Goals from the Mizar Mathematical Library0
GeoGebra Tools with Proof Capabilities0
Analysis of Algorithms and Partial Algorithms0
Solving Geometry Problems: Combining Text and Diagram InterpretationCode0
Rare Speed-up in Automatic Theorem Proving Reveals Tradeoff Between Computational Time and Information Value0
Towards Formal Fault Tree Analysis using Theorem Proving0
GraATP: A Graph Theoretic Approach for Automated Theorem Proving in Plane Geometry0
Performance Guarantees for Schatten-p Quasi-Norm Minimization in Recovery of Low-Rank Matrices0
Learning-assisted Theorem Proving with Millions of Lemmas0
Machine Learner for Automated Reasoning 0.4 and 0.50
A state vector algebra for algorithmic implementation of second-order logic0
HOL(y)Hammer: Online ATP Service for HOL LightCode0
READ-EVAL-PRINT in Parallel and Asynchronous Proof-checking0
Modeling in OWL 2 without Restrictions0
Lectures on Jacques Herbrand as a LogicianCode0
Show:102550
← PrevPage 6 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Kimina-Prover-Previewcumulative80.74Unverified
2ProofAugcumulative66Unverified
3DeepSeek-Prover-V1.5cumulative63.5Unverified
4Subgoal-XLcumulative56.1Unverified
5DeepSeek-Provercumulative52Unverified
6Lyra + GPT-4cumulative47.1Unverified
7LEGO-Prover ChatGPTcumulative47.1Unverified
8Decomposing the Enigmacumulative45.5Unverified
9Evaristecumulative41Unverified
10Evariste-7dcumulative40.6Unverified
#ModelMetricClaimedVerifiedStatus
1EvaristePass@6458.6Unverified
2LEGO-Prover ChatGPTPass@10057Unverified
3Lyra + GPT-4Pass@10052Unverified
4Evariste-7dPass@6447.5Unverified
5GPT-fPass@6447.3Unverified
6Evariste-1dPass@6446.7Unverified
7DSP (62B Minerva informal)Pass@10043.9Unverified
8Lean GPT-fPass@829.3Unverified
9Lean tidyPass@116.8Unverified
10Metamath GPT-fPass@82Unverified
#ModelMetricClaimedVerifiedStatus
1MPNN-DagLSTMClassification Accuracy0.92Unverified
2FormulaNetClassification Accuracy0.9Unverified
3FormulaNet-basicClassification Accuracy0.89Unverified
4Siamese 1D CNN-LSTMClassification Accuracy0.83Unverified
5Siamese 1D CNNClassification Accuracy0.82Unverified
#ModelMetricClaimedVerifiedStatus
14-hop GNN, sub-expression sharingPercentage correct49.95Unverified
2Tactic Dependent LoopPercentage correct38.88Unverified
3BoW2 (extra -ves)Percentage correct36.55Unverified
4Deeper Wider WaveNetPercentage correct32.65Unverified
#ModelMetricClaimedVerifiedStatus
1FormulaNetClassification Accuracy0.9Unverified
2FormulaNet-basicClassification Accuracy0.89Unverified
31D CNN-LSTMClassification Accuracy0.83Unverified
41D CNNClassification Accuracy0.83Unverified
#ModelMetricClaimedVerifiedStatus
1EvaristePass@3272.4Unverified
2GPT-fPercentage correct56.2Unverified
3MetaGen-IL + HolophrasmPercentage correct22.1Unverified
4HolophrasmPercentage correct14.3Unverified
#ModelMetricClaimedVerifiedStatus
1Evariste-7dPass@6442.5Unverified
2Evariste-1dPass@6433.6Unverified
3EvaristePass@6432.1Unverified
4GPT-fPass@6430.6Unverified
#ModelMetricClaimedVerifiedStatus
1Proverbot9001Percentage correct19.36Unverified
2CoqGym/ASTacticPercentage correct4.99Unverified
#ModelMetricClaimedVerifiedStatus
1ASTacticPercentage correct12.2Unverified