Word2Bits - Quantized Word Vectors

2018-03-15Code Available0· sign in to hype

Maximilian Lam

Code Available — Be the first to reproduce this paper.

Code

github.com/agnusmaximus/Word2Bits
none★ 0

Abstract

Word vectors require significant amounts of memory and storage, posing issues to resource limited devices like mobile phones and GPUs. We show that high quality quantized word vectors using 1-2 bits per parameter can be learned by introducing a quantization function into Word2Vec. We furthermore show that training with the quantization function acts as a regularizer. We train word vectors on English Wikipedia (2017) and evaluate them on standard word similarity and analogy tasks and on question answering (SQuAD). Our quantized word vectors not only take 8-16x less space than full precision (32 bit) word vectors but also outperform them on word similarity tasks and question answering.

Tasks

Quantization Question Answering Word Similarity

Word2Bits - Quantized Word Vectors

Code

Abstract

Tasks

Reproductions