SOTAVerified

Unishox: A hybrid encoder for Short Unicode Strings

2022-01-18The Journal of Open Source Software 2022Code Available2· sign in to hype

Arundale Ramanathan

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Unishox is a hybrid encoding technique with which short unicode strings could be compressed using context aware pre-mapped codes and delta coding resulting in surprisingly good ratios. Space occupied by short strings become significant in memory constrained environments such as Arduino Uno and ESP8266. Text exchange in Chat applications and social media posts is another area where cost savings could be seen using such compression. It is also possible to achieve savings in bandwidth and storage cost by storing and retrieving independent strings in Cloud databases.

Reproductions