Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs

2025-02-24Code Available0· sign in to hype

Himanshu Beniwal, Sailesh Panda, Birudugadda Srivibhav, Mayank Singh

Code Available — Be the first to reproduce this paper.

Code

github.com/himanshubeniwal/x-bat
OfficialIn paperpytorch★ 1

Abstract

We explore Cross-lingual Backdoor ATtacks (X-BAT) in multilingual Large Language Models (mLLMs), revealing how backdoors inserted in one language can automatically transfer to others through shared embedding spaces. Using toxicity classification as a case study, we demonstrate that attackers can compromise multilingual systems by poisoning data in a single language, with rare and high-occurring tokens serving as specific, effective triggers. Our findings expose a critical vulnerability that influences the model's architecture, resulting in a concealed backdoor effect during the information flow. Our code and data are publicly available https://github.com/himanshubeniwal/X-BAT.

Tasks

Cross-Lingual Transfer

Char-mander Use mBackdoor! A Study of Cross-lingual Backdoor Attacks in Multilingual LLMs

Code

Abstract

Tasks

Reproductions