Improving text classification with vectors of reduced precision

2017-06-20Code Available0· sign in to hype

Krzysztof Wróbel, Maciej Wielgosz, Marcin Pietroń, Michał Karwatowski, Aleksander Smywiński-Pohl

Code Available — Be the first to reproduce this paper.

Code

github.com/kwrobel-nlp/precision-reduction
OfficialIn papernone★ 0

Abstract

This paper presents the analysis of the impact of a floating-point number precision reduction on the quality of text classification. The precision reduction of the vectors representing the data (e.g. TF-IDF representation in our case) allows for a decrease of computing time and memory footprint on dedicated hardware platforms. The impact of precision reduction on the classification quality was performed on 5 corpora, using 4 different classifiers. Also, dimensionality reduction was taken into account. Results indicate that the precision reduction improves classification accuracy for most cases (up to 25% of error reduction). In general, the reduction from 64 to 4 bits gives the best scores and ensures that the results will not be worse than with the full floating-point representation.

Tasks

Classification Dimensionality Reduction General Classification text-classification Text Classification

Improving text classification with vectors of reduced precision

Code

Abstract

Tasks

Reproductions