| SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis | Oct 21, 2024 | LLM JailbreakRed Teaming | CodeCode Available | 0 | 5 |
| SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage | Dec 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response | May 22, 2024 | LLM JailbreakSafety Alignment | —Unverified | 0 | 0 |
| DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak | Dec 23, 2024 | DenoisingDiversity | —Unverified | 0 | 0 |
| Efficient Indirect LLM Jailbreak via Multimodal-LLM Jailbreak | May 30, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks | Oct 5, 2024 | LLM Jailbreak | —Unverified | 0 | 0 |
| Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack | Apr 2, 2024 | LLM Jailbreak | —Unverified | 0 | 0 |
| Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles | Aug 20, 2024 | ArticlesLanguage Modeling | —Unverified | 0 | 0 |
| HSF: Defending against Jailbreak Attacks with Hidden State Filtering | Aug 31, 2024 | LLM Jailbreak | —Unverified | 0 | 0 |
| LLM Jailbreak Oracle | Jun 17, 2025 | LLM Jailbreak | —Unverified | 0 | 0 |