TREE-G: Decision Trees Contesting Graph Neural Networks

2022-07-06Code Available1· sign in to hype

Maya Bechler-Speicher, Amir Globerson, Ran Gilad-Bachrach

Code Available — Be the first to reproduce this paper.

Code

github.com/mayabechlerspeicher/tree-g
OfficialIn paperpytorch★ 13

Abstract

When dealing with tabular data, models based on decision trees are a popular choice due to their high accuracy on these data types, their ease of application, and explainability properties. However, when it comes to graph-structured data, it is not clear how to apply them effectively, in a way that incorporates the topological information with the tabular data available on the vertices of the graph. To address this challenge, we introduce TREE-G. TREE-G modifies standard decision trees, by introducing a novel split function that is specialized for graph data. Not only does this split function incorporate the node features and the topological information, but it also uses a novel pointer mechanism that allows split nodes to use information computed in previous splits. Therefore, the split function adapts to the predictive task and the graph at hand. We analyze the theoretical properties of TREE-G and demonstrate its benefits empirically on multiple graph and vertex prediction benchmarks. In these experiments, TREE-G consistently outperforms other tree-based models and often outperforms other graph-learning algorithms such as Graph Neural Networks (GNNs) and Graph Kernels, sometimes by large margins. Moreover, TREE-Gs models and their predictions can be explained and visualized

Tasks

Graph Classification Graph Learning Graph Regression Node Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
D&D	TREE-G	Accuracy	76.2	—	Unverified
ENZYMES	TREE-G	Accuracy	59.6	—	Unverified
HIV dataset	TREE-G	Accuracy	83.5	—	Unverified
IMDb-B	TREE-G	Accuracy	73	—	Unverified
IMDb-M	TREE-G	Accuracy	56.4	—	Unverified
MUTAG	TREE-G	Accuracy	91.1	—	Unverified
Mutagenicity	TREE-G	Accuracy	83	—	Unverified
NCI1	TREE-G	Accuracy	75.9	—	Unverified
PROTEINS	TREE-G	Accuracy	75.6	—	Unverified
PTC	TREE-G	Accuracy	59.1	—	Unverified

TREE-G: Decision Trees Contesting Graph Neural Networks

Code

Abstract

Tasks

Benchmark Results

Reproductions