Authorship clustering using multi-headed recurrent neural networks

2016-08-16Code Available0· sign in to hype

Douglas Bagnall

Code Available — Be the first to reproduce this paper.

Code

github.com/douglasbagnall/bog
OfficialIn papernone★ 0

Abstract

A recurrent neural network that has been trained to separately model the language of several documents by unknown authors is used to measure similarity between the documents. It is able to find clues of common authorship even when the documents are very short and about disparate topics. While it is easy to make statistically significant predictions regarding authorship, it is difficult to group documents into definite clusters with high accuracy.

Tasks

Clustering

Authorship clustering using multi-headed recurrent neural networks

Code

Abstract

Tasks

Reproductions