SOTAVerified

High Performance of Gradient Boosting in Binding Affinity Prediction

2022-05-14Unverified0· sign in to hype

Dmitrii Gavrilev, Nurlybek Amangeldiuly, Sergei Ivanov, Evgeny Burnaev

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Prediction of protein-ligand (PL) binding affinity remains the key to drug discovery. Popular approaches in recent years involve graph neural networks (GNNs), which are used to learn the topology and geometry of PL complexes. However, GNNs are computationally heavy and have poor scalability to graph sizes. On the other hand, traditional machine learning (ML) approaches, such as gradient-boosted decision trees (GBDTs), are lightweight yet extremely efficient for tabular data. We propose to use PL interaction features along with PL graph-level features in GBDT. We show that this combination outperforms the existing solutions.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
CSAR-HiQLightGBMRMSE1.73Unverified
PDBbindLightGBMRMSE1.32Unverified

Reproductions