Differential Privacy of Cross-Attention with Provable Guarantee

2024-07-20Unverified0· sign in to hype

YIngyu Liang, Zhenmei Shi, Zhao Song, Yufa Zhou

Unverified — Be the first to reproduce this paper.

Abstract

Cross-attention has become a fundamental module nowadays in many important artificial intelligence applications, e.g., retrieval-augmented generation (RAG), system prompt, guided stable diffusion, and many more. Ensuring cross-attention privacy is crucial and urgently needed because its key and value matrices may contain sensitive information about model providers and their users. In this work, we design a novel differential privacy (DP) data structure to address the privacy security of cross-attention with a theoretical guarantee. In detail, let n be the input token length of system prompt/RAG data, d be the feature dimension, 0 < 1 be the relative error parameter, R be the maximum value of the query and key matrices, R_w be the maximum value of the value matrix, and r,s,_s be parameters of polynomial kernel methods. Then, our data structure requires O(ndr^2) memory consumption with O(nr^2) initialization time complexity and O(^-1 r^2) query time complexity for a single token query. In addition, our data structure can guarantee that the process of answering user query satisfies (, )-DP with O(n^-1 ^-1 ^-1/2 R^2s R_w r^2) additive error and n^-1 ( + _s) relative error between our output and the true answer. Furthermore, our result is robust to adaptive queries in which users can intentionally attack the cross-attention system. To our knowledge, this is the first work to provide DP for cross-attention and is promising to inspire more privacy algorithm design in large generative models (LGMs).

Tasks

RAG Retrieval-augmented Generation

Differential Privacy of Cross-Attention with Provable Guarantee

Abstract

Tasks

Reproductions