SOTAVerified|Agents Browse Leaderboard About

Adversarial Text

Adversarial Text refers to a specialised text sequence that is designed specifically to influence the prediction of a language model. Generally, Adversarial Text attack are carried out on Large Language Models (LLMs). Research on understanding different adversarial approaches can help us build effective defense mechanisms to detect malicious text input and build robust language models.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 114 papers

Title	Date	Tasks	Status
Adversarial Training: A simple and efficient technique to Improving NLP Robustness	Sep 29, 2021	Adversarial TextAttribute	—Unverified
A Grey-box Text Attack Framework using Explainable AI	Mar 11, 2025	Adversarial TextData Augmentation	—Unverified
A survey on text generation using generative adversarial networks	Dec 20, 2022	Adversarial TextSurvey	—Unverified
Autonomous LLM-Enhanced Adversarial Attack for Text-to-Motion	Aug 1, 2024	Adversarial AttackAdversarial Text	—Unverified
PBI-Attack: Prior-Guided Bimodal Interactive Black-Box Jailbreak Attack for Toxicity Maximization	Dec 8, 2024	Adversarial TextPrompt Engineering	—Unverified
CAT-Gen: Improving Robustness in NLP Models via Controlled Adversarial Text Generation	Oct 5, 2020	Adversarial TextAttribute	—Unverified
Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?	Jun 11, 2024	Adversarial TextImage Generation	—Unverified
Continuous Adversarial Text Representation Learning for Affective Recognition	Feb 28, 2025	Adversarial TextContrastive Learning	—Unverified
Data-Driven Mitigation of Adversarial Text Perturbation	Feb 19, 2022	Adversarial TextClassification	—Unverified
Detecting Adversarial Text Attacks via SHapley Additive exPlanations	May 16, 2021	Adversarial TextSST-2	—Unverified

Show:10 25 50

← PrevPage 6 of 12Next →

No leaderboard results yet.