High Performance of Gradient Boosting in Binding Affinity Prediction

2022-05-14Unverified0· sign in to hype

Dmitrii Gavrilev, Nurlybek Amangeldiuly, Sergei Ivanov, Evgeny Burnaev

Unverified — Be the first to reproduce this paper.

Abstract

Prediction of protein-ligand (PL) binding affinity remains the key to drug discovery. Popular approaches in recent years involve graph neural networks (GNNs), which are used to learn the topology and geometry of PL complexes. However, GNNs are computationally heavy and have poor scalability to graph sizes. On the other hand, traditional machine learning (ML) approaches, such as gradient-boosted decision trees (GBDTs), are lightweight yet extremely efficient for tabular data. We propose to use PL interaction features along with PL graph-level features in GBDT. We show that this combination outperforms the existing solutions.

Tasks

Drug Discovery Protein-Ligand Affinity Prediction Vocal Bursts Intensity Prediction

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CSAR-HiQ	LightGBM	RMSE	1.73	—	Unverified
PDBbind	LightGBM	RMSE	1.32	—	Unverified

High Performance of Gradient Boosting in Binding Affinity Prediction

Abstract

Tasks

Benchmark Results

Reproductions