SOTAVerified

A Bayesian Flow Network Framework for Chemistry Tasks

2024-07-28Code Available1· sign in to hype

Nianze Tao, Minori Abe

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this work, we introduce ChemBFN, a language model that handles chemistry tasks based on Bayesian flow networks working on discrete data. A new accuracy schedule is proposed to improve the sampling quality by significantly reducing the reconstruction loss. We show evidence that our method is appropriate for generating molecules with satisfied diversity even when a smaller number of sampling steps is used. A classifier-free guidance method is adapted for conditional generation. It is also worthwhile to point out that after generative training, our model can be fine-tuned on regression and classification tasks with the state-of-the-art performance, which opens the gate of building all-in-one models in a single module style. Our model has been open sourced at https://github.com/Augus1999/bayesian-flow-network-for-chemistry.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
BACEChemBFNROC-AUC73.56Unverified
BBBPChemBFNROC-AUC95.74Unverified
clintoxChemBFNROC-AUC99.18Unverified
ESOLChemBFNRMSE0.88Unverified
FreeSolvChemBFNRMSE1.42Unverified
HIVChemBFNROC-AUC79.37Unverified
HIV datasetChemBFNAUC0.79Unverified
LipophilicityChemBFNRMSE0.75Unverified

Reproductions