GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

2023-08-14Code Available1· sign in to hype

PengFei Liu, Yiming Ren, Jun Tao, Zhixiang Ren

Code Available — Be the first to reproduce this paper.

Code

github.com/ai-hpc-research-team/git-mol
OfficialIn paperpytorch★ 36

Abstract

Large language models have made significant strides in natural language processing, enabling innovative applications in molecular science by processing textual representations of molecules. However, most existing language models cannot capture the rich information with complex molecular structures or images. In this paper, we introduce GIT-Mol, a multi-modal large language model that integrates the Graph, Image, and Text information. To facilitate the integration of multi-modal molecular data, we propose GIT-Former, a novel architecture that is capable of aligning all modalities into a unified latent space. We achieve a 5%-10% accuracy increase in properties prediction and a 20.2% boost in molecule generation validity compared to the baselines. With the any-to-language molecular translation strategy, our model has the potential to perform more downstream tasks, such as compound name recognition and chemical reaction prediction.

Tasks

Drug Discovery Image Captioning Language Modeling Language Modelling Large Language Model molecular representation Molecule Captioning Property Prediction Text-based de novo Molecule Generation

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
BACE	GIT-Mol(G+S)	AUC	0.81	—	Unverified
BBBP	GIT-Mol(G+S)	AUC	0.74	—	Unverified
clintox	GIT-Mol(G+S)	AUC	0.88	—	Unverified
SIDER	GIT-Mol(G+S)	AUC	0.63	—	Unverified
Tox21	GIT-Mol(G+S)	AUC	0.76	—	Unverified
ToxCast	GIT-Mol(G+S)	AUC	0.67	—	Unverified

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Code

Abstract

Tasks

Benchmark Results

Reproductions