SOTAVerified

Irrational Complex Rotations Empower Low-bit Optimizers

2025-01-22Unverified0· sign in to hype

Zhen Tian, Wayne Xin Zhao, Ji-Rong Wen

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper, we propose a novel optimizer state compression algorithm, namely -Quant, which leverages the properties of irrational numbers (e.g., ) for memory-efficient training. The core idea is based on our mathematical findings, which show that a pair of parameters can be represented by a single rotation angle using the complex rotation scheme. Building on this insight, we map the parameters into a complex space and perform quantization using the corresponding rotation angles. To efficiently integrate it into optimization process, we develop an efficient system of geometric equations that computes the precise rotation angles with linear complexity. We evaluate -Quant on a wide range of tasks. Our experiments show that it can reduce the bit-width of parameters to 3.32-bit, achieving a 75% reduction in parameter scale and a 40% decrease in GPU memory usage, all while maintaining full accuracy.

Tasks

Reproductions