SOTAVerified|Agents Browse Leaderboard About

Adversarial Text

Adversarial Text refers to a specialised text sequence that is designed specifically to influence the prediction of a language model. Generally, Adversarial Text attack are carried out on Large Language Models (LLMs). Research on understanding different adversarial approaches can help us build effective defense mechanisms to detect malicious text input and build robust language models.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–50 of 114 papers

Title	Date	Tasks	Status	Hype
Towards a Robust Detection of Language Model Generated Text: Is ChatGPT that Easy to Detect?	Jun 9, 2023	Adversarial TextLanguage Modeling	—Unverified	0
VoteTRANS: Detecting Adversarial Text without Training by Voting on Hard Labels of Transformations	Jun 2, 2023	Adversarial Text	CodeCode Available	0
How do humans perceive adversarial text? A reality check on the validity and naturalness of word-based adversarial attacks	May 24, 2023	Adversarial Text	—Unverified	0
Iterative Adversarial Attack on Image-guided Story Ending Generation	May 16, 2023	Adversarial AttackAdversarial Robustness	—Unverified	0
Less is More: Removing Text-regions Improves CLIP Training Efficiency and Robustness	May 8, 2023	Adversarial TextRetrieval	CodeCode Available	0
Towards Imperceptible Document Manipulations against Neural Ranking Models	May 3, 2023	Adversarial TextLanguage Modeling	—Unverified	0
A Pilot Study of Query-Free Adversarial Attack against Stable Diffusion	Mar 29, 2023	Adversarial AttackAdversarial Robustness	CodeCode Available	1
Frauds Bargain Attack: Generating Adversarial Text Samples via Word Manipulation Process	Mar 1, 2023	Adversarial TextSentence	CodeCode Available	0
Improved Training of Mixture-of-Experts Language GANs	Feb 23, 2023	Adversarial TextImage Generation	—Unverified	0
RETVec: Resilient and Efficient Text Vectorizer	Feb 18, 2023	Adversarial TextMetric Learning	CodeCode Available	2

Show:10 25 50

← PrevPage 5 of 12Next →

No leaderboard results yet.