Inductive Representation Learning on Large Graphs

2017-06-07NeurIPS 2017Code Available1· sign in to hype

William L. Hamilton, Rex Ying, Jure Leskovec

Code Available — Be the first to reproduce this paper.

Code

github.com/williamleif/GraphSAGE
Officialtf★ 0
github.com/weiyinwei/mmgcn
pytorch★ 324
github.com/IllinoisGraphBenchmark/IGB-Datasets
pytorch★ 86
github.com/isotlaboratory/ml4vrp
pytorch★ 16
github.com/weiyinwei/huign
pytorch★ 10
github.com/pmlg/pytorch_GCN
pytorch★ 3
github.com/erfmah/answering_graph_queries
pytorch★ 1
github.com/chatterjeeayan/upna
pytorch★ 1
github.com/arangoml/fastgraphml
pytorch★ 0
github.com/stellargraph/stellargraph
tf★ 0

Abstract

Low-dimensional embeddings of nodes in large graphs have proved extremely useful in a variety of prediction tasks, from content recommendation to identifying protein functions. However, most existing approaches require that all nodes in the graph are present during training of the embeddings; these previous approaches are inherently transductive and do not naturally generalize to unseen nodes. Here we present GraphSAGE, a general, inductive framework that leverages node feature information (e.g., text attributes) to efficiently generate node embeddings for previously unseen data. Instead of training individual embeddings for each node, we learn a function that generates embeddings by sampling and aggregating features from a node's local neighborhood. Our algorithm outperforms strong baselines on three inductive node-classification benchmarks: we classify the category of unseen nodes in evolving information graphs based on citation and Reddit post data, and we show that our algorithm generalizes to completely unseen graphs using a multi-graph dataset of protein-protein interactions.

Tasks

Graph Classification Graph Regression Link Prediction Link Property Prediction Node Classification Node Classification on Non-Homophilic (Heterophilic) Graphs Node Property Prediction Representation Learning

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CIFAR10 100k	GraphSage	Accuracy (%)	66.08	—	Unverified

Inductive Representation Learning on Large Graphs

Code

Abstract

Tasks

Benchmark Results

Reproductions