Word Embeddings for Multi-label Document Classification

2017-09-01RANLP 2017Unverified0· sign in to hype

Ladislav Lenc, Pavel Kr{\'a}l

Unverified — Be the first to reproduce this paper.

Abstract

In this paper, we analyze and evaluate word embeddings for representation of longer texts in the multi-label classification scenario. The embeddings are used in three convolutional neural network topologies. The experiments are realized on the Czech CTK and English Reuters-21578 standard corpora. We compare the results of word2vec static and trainable embeddings with randomly initialized word vectors. We conclude that initialization does not play an important role for classification. However, learning of word vectors is crucial to obtain good results.

Tasks

Classification Document Classification General Classification Multi-Label Classification MUlTI-LABEL-ClASSIFICATION Sentiment Analysis Text Classification Word Embeddings

Word Embeddings for Multi-label Document Classification

Abstract

Tasks

Reproductions