Gzip versus bag-of-words for text classification
2023-07-27Code Available0· sign in to hype
Juri Opitz
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/flipz357/npc_gzip_expOfficialIn paperpytorch★ 9
Abstract
The effectiveness of compression in text classification ('gzip') has recently garnered lots of attention. In this note we show that `bag-of-words' approaches can achieve similar or better results, and are more efficient.