SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
LLM Jailbreak
LLM Jailbreak
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 21–24 of 24 papers
Title
Date
Tasks
Status
Hype
WordGame: Efficient & Effective LLM Jailbreak via Simultaneous Obfuscation in Query and Response
May 22, 2024
LLM Jailbreak
Safety Alignment
—
Unverified
0
Efficient LLM Jailbreak via Adaptive Dense-to-sparse Constrained Optimization
May 15, 2024
LLM Jailbreak
Code
Code Available
0
Great, Now Write an Article About That: The Crescendo Multi-Turn LLM Jailbreak Attack
Apr 2, 2024
LLM Jailbreak
—
Unverified
0
Self-Deception: Reverse Penetrating the Semantic Firewall of Large Language Models
Aug 16, 2023
LLM Jailbreak
—
Unverified
0
Show:
10
25
50
← Prev
Page 3 of 3
Next →
No leaderboard results yet.