AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval

2024-04-09Code Available2· sign in to hype

Kento Tatsuno, Daisuke Miyashita, Taiga Ikeda, Kiyoshi Ishiyama, Kazunari Sumiyoshi, Jun Deguchi

Code Available — Be the first to reproduce this paper.

Code

github.com/cwida/pdx
none★ 83

Abstract

Graph-based approximate nearest neighbor search (ANNS) algorithms work effectively against large-scale vector retrieval. Among such methods, DiskANN achieves good recall-speed tradeoffs using both DRAM and storage. DiskANN adopts product quantization (PQ) to reduce memory usage, which is still proportional to the scale of datasets. In this paper, we propose All-in-Storage ANNS with Product Quantization (AiSAQ), which offloads compressed vectors to the SSD index. Our method achieves 10 MB memory usage in query search with billion-scale datasets without critical latency degradation. AiSAQ also reduces the index load time for query search preparation, which enables fast switch between muitiple billion-scale indices.This method can be applied to retrievers of retrieval-augmented generation (RAG) and be scaled out with multiple-server systems for emerging datasets. Our DiskANN-based implementation is available on GitHub.

Tasks

All Information Retrieval Quantization RAG Retrieval Retrieval-augmented Generation

AiSAQ: All-in-Storage ANNS with Product Quantization for DRAM-free Information Retrieval

Code

Abstract

Tasks

Reproductions