| Red-Teaming Text-to-Image Systems by Rule-based Preference Modeling | May 27, 2025 | Red Teaming | —Unverified | 0 |
| Red Teaming the Mind of the Machine: A Systematic Evaluation of Prompt Injection and Jailbreak Vulnerabilities in LLMs | May 7, 2025 | Red Teaming | —Unverified | 0 |
| Red-Teaming the Stable Diffusion Safety Filter | Oct 3, 2022 | Image GenerationRed Teaming | —Unverified | 0 |
| Red Teaming Visual Language Models | Jan 23, 2024 | FairnessRed Teaming | —Unverified | 0 |
| Red Teaming with Artificial Intelligence-Driven Cyberattacks: A Scoping Review | Mar 25, 2025 | ArticlesRed Teaming | —Unverified | 0 |
| Reinforced Diffuser for Red Teaming Large Vision-Language Models | Mar 8, 2025 | Large Language ModelRed Teaming | —Unverified | 0 |
| RRTL: Red Teaming Reasoning Large Language Models in Tool Learning | May 21, 2025 | Red Teaming | —Unverified | 0 |
| Ruby Teaming: Improving Quality Diversity Search with Memory for Automated Red Teaming | Jun 17, 2024 | DiversityRed Teaming | —Unverified | 0 |
| SafeCOMM: What about Safety Alignment in Fine-Tuned Telecom Large Language Models? | May 29, 2025 | DiagnosticRed Teaming | —Unverified | 0 |
| Safety Alignment for Vision Language Models | May 22, 2024 | Red TeamingSafety Alignment | —Unverified | 0 |