CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

2020-09-16EMNLP 2020Code Available1· sign in to hype

Tara Safavi, Danai Koutra

Code Available — Be the first to reproduce this paper.

Code

github.com/tsafavi/codex
OfficialIn paperpytorch★ 173
github.com/facebookresearch/ssl-relation-prediction
pytorch★ 111

Abstract

We present CoDEx, a set of knowledge graph completion datasets extracted from Wikidata and Wikipedia that improve upon existing knowledge graph completion benchmarks in scope and level of difficulty. In terms of scope, CoDEx comprises three knowledge graphs varying in size and structure, multilingual descriptions of entities and relations, and tens of thousands of hard negative triples that are plausible but verified to be false. To characterize CoDEx, we contribute thorough empirical analyses and benchmarking experiments. First, we analyze each CoDEx dataset in terms of logical relation patterns. Next, we report baseline link prediction and triple classification results on CoDEx for five extensively tuned embedding models. Finally, we differentiate CoDEx from the popular FB15K-237 knowledge graph completion dataset by showing that CoDEx covers more diverse and interpretable content, and is a more difficult link prediction benchmark. Data, code, and pretrained models are available at https://bit.ly/2EPbrJs.

Tasks

Benchmarking Knowledge Graph Completion Knowledge Graphs Link Prediction Triple Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CoDEx Large	TransE	MRR	0.19	—	Unverified
CoDEx Large	ComplEx	MRR	0.29	—	Unverified
CoDEx Large	TuckER	MRR	0.31	—	Unverified
CoDEx Large	ConvE	MRR	0.3	—	Unverified
CoDEx Large	RESCAL	MRR	0.3	—	Unverified
CoDEx Medium	ConvE	MRR	0.32	—	Unverified
CoDEx Medium	ComplEx	MRR	0.34	—	Unverified
CoDEx Medium	TuckER	MRR	0.33	—	Unverified
CoDEx Medium	RESCAL	MRR	0.32	—	Unverified
CoDEx Medium	TransE	MRR	0.3	—	Unverified
CoDEx Small	TuckER	MRR	0.44	—	Unverified
CoDEx Small	TransE	MRR	0.35	—	Unverified
CoDEx Small	ComplEx	MRR	0.4	—	Unverified
CoDEx Small	RESCAL	MRR	0.4	—	Unverified
CoDEx Small	ConvE	MRR	0.44	—	Unverified

CoDEx: A Comprehensive Knowledge Graph Completion Benchmark

Code

Abstract

Tasks

Benchmark Results

Reproductions