Unishox: A hybrid encoder for Short Unicode Strings
2022-01-18The Journal of Open Source Software 2022Code Available2· sign in to hype
Arundale Ramanathan
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/siara-cc/unishoxOfficialIn papernone★ 237
- github.com/siara-cc/Unishox2In papernone★ 237
Abstract
Unishox is a hybrid encoding technique with which short unicode strings could be compressed using context aware pre-mapped codes and delta coding resulting in surprisingly good ratios. Space occupied by short strings become significant in memory constrained environments such as Arduino Uno and ESP8266. Text exchange in Chat applications and social media posts is another area where cost savings could be seen using such compression. It is also possible to achieve savings in bandwidth and storage cost by storing and retrieving independent strings in Cloud databases.