Context-Aware Membership Inference Attacks against Pre-trained Large Language Models

2024-09-11Unverified0· sign in to hype

Hongyan Chang, Ali Shahin Shamsabadi, Kleomenis Katevas, Hamed Haddadi, Reza Shokri

Unverified — Be the first to reproduce this paper.

Abstract

Prior Membership Inference Attacks (MIAs) on pre-trained Large Language Models (LLMs), adapted from classification model attacks, fail due to ignoring the generative process of LLMs across token sequences. In this paper, we present a novel attack that adapts MIA statistical tests to the perplexity dynamics of subsequences within a data point. Our method significantly outperforms prior loss-based approaches, revealing context-dependent memorization patterns in pre-trained LLMs.

Tasks

Memorization

Context-Aware Membership Inference Attacks against Pre-trained Large Language Models

Abstract

Tasks

Reproductions