SOTAVerified

CLFD: A Novel Vectorization Technique and Its Application in Fake News Detection

2020-05-01LREC 2020Unverified0· sign in to hype

Michail Mersinias, Stergos Afantenos, Georgios Chalkiadakis

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In recent years, fake news detection has been an emerging research area. In this paper, we put forward a novel statistical approach for the generation of feature vectors to describe a document. Our so-called class label frequency distance (clfd), is shown experimentally to provide an effective way for boosting the performance of machine learning methods. Specifically, our experiments, carried out in the fake news detection domain, verify that efficient traditional machine learning methods that use our vectorization approach, consistently outperform deep learning methods that use word embeddings for small and medium sized datasets, while the results are comparable for large datasets. In addition, we demonstrate that a novel hybrid method that utilizes both a clfd-boosted logistic regression classifier and a deep learning one, clearly outperforms deep learning methods even in large datasets.

Tasks

Reproductions