SOTAVerified|Agents Browse Leaderboard About Blog

LLM Jailbreak

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 11–20 of 24 papers

Title	Date	Tasks	Status	Hype	Score
SMILES-Prompting: A Novel Approach to LLM Jailbreak Attacks in Chemical Synthesis	Oct 21, 2024	LLM JailbreakRed Teaming	CodeCode Available	0	5
SATA: A Paradigm for LLM Jailbreak via Simple Assistive Task Linkage	Dec 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	0	5
WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response	May 22, 2024	LLM JailbreakSafety Alignment	—Unverified	0	0
DiffusionAttacker: Diffusion-Driven Prompt Manipulation for LLM Jailbreak	Dec 23, 2024	DenoisingDiversity	—Unverified	0	0
Efficient Indirect LLM Jailbreak via Multimodal-LLM Jailbreak	May 30, 2024	Language ModelingLanguage Modelling	—Unverified	0	0
Functional Homotopy: Smoothing Discrete Optimization via Continuous Parameters for LLM Jailbreak Attacks	Oct 5, 2024	LLM Jailbreak	—Unverified	0	0
Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack	Apr 2, 2024	LLM Jailbreak	—Unverified	0	0
Hide Your Malicious Goal Into Benign Narratives: Jailbreak Large Language Models through Carrier Articles	Aug 20, 2024	ArticlesLanguage Modeling	—Unverified	0	0
HSF: Defending against Jailbreak Attacks with Hidden State Filtering	Aug 31, 2024	LLM Jailbreak	—Unverified	0	0
LLM Jailbreak Oracle	Jun 17, 2025	LLM Jailbreak	—Unverified	0	0

Show:10 25 50

← PrevPage 2 of 3Next →

No leaderboard results yet.