| Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks | Oct 5, 2024 | LLM Jailbreak | —Unverified | 0 |
| Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack | Apr 2, 2024 | LLM Jailbreak | —Unverified | 0 |
| Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles | Aug 20, 2024 | ArticlesLanguage Modeling | —Unverified | 0 |
| HSF: Defending against Jailbreak Attacks with Hidden State Filtering | Aug 31, 2024 | LLM Jailbreak | —Unverified | 0 |
| LLM Jailbreak Oracle | Jun 17, 2025 | LLM Jailbreak | —Unverified | 0 |
| POEX: Understanding and Mitigating Policy Executable Jailbreak Attacks against Embodied AI | Dec 21, 2024 | LLM JailbreakRed Teaming | —Unverified | 0 |
| SecurityLingua: Efficient Defense of LLM Jailbreak Attacks via Security-Aware Prompt Compression | Jun 15, 2025 | LLM JailbreakSafety Alignment | —Unverified | 0 |
| Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models | Aug 16, 2023 | LLM Jailbreak | —Unverified | 0 |
| SelfDefend: LLMs Can Defend Themselves against Jailbreaking in a Practical Manner | Jun 8, 2024 | Adversarial AttackLLM Jailbreak | —Unverified | 0 |
| Graph of Attacks with Pruning: Optimizing Stealthy Jailbreak Prompt Generation for Enhanced LLM Content Moderation | Jan 28, 2025 | LLM Jailbreak | CodeCode Available | 0 |