Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting

2024-10-01Code Available0· sign in to hype

Stephen Meisenbacher, Florian Matthes

Code Available — Be the first to reproduce this paper.

Code

github.com/sjmeis/DPNONDP
Officialpytorch★ 1

Abstract

The field of privacy-preserving Natural Language Processing has risen in popularity, particularly at a time when concerns about privacy grow with the proliferation of Large Language Models. One solution consistently appearing in recent literature has been the integration of Differential Privacy (DP) into NLP techniques. In this paper, we take these approaches into critical view, discussing the restrictions that DP integration imposes, as well as bring to light the challenges that such restrictions entail. To accomplish this, we focus on DP-Prompt, a recent method for text privatization leveraging language models to rewrite texts. In particular, we explore this rewriting task in multiple scenarios, both with DP and without DP. To drive the discussion on the merits of DP in NLP, we conduct empirical utility and privacy experiments. Our results demonstrate the need for more discussion on the usability of DP in NLP and its benefits over non-DP approaches.

Tasks

Language Modeling Language Modelling Privacy Preserving

Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting

Code

Abstract

Tasks

Reproductions