SOTAVerified

LLM Jailbreak

Papers

Showing 110 of 24 papers

TitleStatusHype
CAVGAN: Unifying Jailbreak and Defense of LLMs via Generative Adversarial Attacks on their Internal RepresentationsCode0
LLM Jailbreak Oracle0
SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression0
PandaGuard: Systematic Evaluation of LLM Safety against Jailbreaking AttacksCode2
Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content ModerationCode0
CySecBench: Generative AI-based CyberSecurity-focused Prompt Dataset for Benchmarking Large Language ModelsCode1
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak0
POEX: Understanding and Mitigating Policy Executable Jailbreak Attacks against Embodied AI0
SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task LinkageCode0
SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical SynthesisCode0
Show:102550
← PrevPage 1 of 3Next →

No leaderboard results yet.