SOTAVerified

Red Teaming

Papers

Showing 4150 of 251 papers

TitleStatusHype
RAG LLMs are Not Safer: A Safety Analysis of Retrieval-Augmented Generation for Large Language Models0
Understanding and Mitigating Risks of Generative AI in Financial Services0
RainbowPlus: Enhancing Adversarial Prompt Generation via Evolutionary Quality-Diversity SearchCode1
ELAB: Extensive LLM Alignment Benchmark in Persian Language0
X-Teaming: Multi-Turn Jailbreaks and Defenses with Adaptive Multi-Agents0
The Structural Safety Generalization ProblemCode0
Multi-lingual Multi-turn Automated Red Teaming for LLMs0
Strategize Globally, Adapt Locally: A Multi-Turn Red Teaming Agent with Dual-Level Learning0
sudo rm -rf agentic_securityCode1
Red Teaming with Artificial Intelligence-Driven Cyberattacks: A Scoping Review0
Show:102550
← PrevPage 5 of 26Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SUDOAttack Success Rate41Unverified