SOTAVerified

Adversarial Robustness

Adversarial Robustness evaluates the vulnerabilities of machine learning models under various types of adversarial attacks.

Papers

Showing 101125 of 1746 papers

TitleStatusHype
Improved Diffusion-based Generative Model with Better Adversarial RobustnessCode0
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences0
Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement ErrorCode1
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear InterpolationCode0
Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide0
Generalization Certificates for Adversarially Robust Bayesian Linear Regression0
Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives0
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives0
On the Promise for Assurance of Differentiable Neurosymbolic Reasoning Paradigms0
General Coded Computing: Adversarial Settings0
RoMA: Robust Malware Attribution via Byte-level Adversarial Training with Global Perturbations and Adversarial Consistency Regularization0
A Survey on Explainable Deep Reinforcement Learning0
Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits0
Confidence Elicitation: A New Attack Vector for Large Language ModelsCode0
Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models0
Improving Adversarial Robustness via Phase and Amplitude-aware Prompting0
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach0
Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks0
Adversarial Reasoning at Jailbreaking TimeCode1
Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language ModelsCode1
Adversarial Robustness in Two-Stage Learning-to-Defer: Algorithms and Guarantees0
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models0
Boosting Adversarial Robustness and Generalization with Structural Prior0
Trading Inference-Time Compute for Adversarial Robustness0
Topological Signatures of Adversaries in Multimodal Alignments0
Show:102550
← PrevPage 5 of 70Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeBERTa (single model)Accuracy0.61Unverified
2ALBERT (single model)Accuracy0.59Unverified
3T5 (single model)Accuracy0.57Unverified
4SMART_RoBERTa (single model)Accuracy0.54Unverified
5FreeLB (single model)Accuracy0.5Unverified
6RoBERTa (single model)Accuracy0.5Unverified
7InfoBERT (single model)Accuracy0.46Unverified
8ELECTRA (single model)Accuracy0.42Unverified
9BERT (single model)Accuracy0.34Unverified
10SMART_BERT (single model)Accuracy0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed classifierAccuracy95.23Unverified
2Stochastic-LWTA/PGD/WideResNet-34-10Accuracy92.26Unverified
3Stochastic-LWTA/PGD/WideResNet-34-5Accuracy91.88Unverified
4GLOT-DRAccuracy84.13Unverified
5TRADES-ANCRA/ResNet18Accuracy81.7Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (SGD, Cosine)Accuracy77.4Unverified
2ResNet-50 (SGD, Step)Accuracy76.9Unverified
3DeiT-S (AdamW, Cosine)Accuracy76.8Unverified
4ResNet-50 (AdamW, Cosine)Accuracy76.4Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy12.2Unverified
2ResNet-50 (SGD, Cosine)Accuracy3.3Unverified
3ResNet-50 (SGD, Step)Accuracy3.2Unverified
4ResNet-50 (AdamW, Cosine)Accuracy3.1Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (AdamW, Cosine)mean Corruption Error (mCE)59.3Unverified
2ResNet-50 (SGD, Step)mean Corruption Error (mCE)57.9Unverified
3ResNet-50 (SGD, Cosine)mean Corruption Error (mCE)56.9Unverified
4DeiT-S (AdamW, Cosine)mean Corruption Error (mCE)48Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy13Unverified
2ResNet-50 (SGD, Cosine)Accuracy8.4Unverified
3ResNet-50 (SGD, Step)Accuracy8.3Unverified
4ResNet-50 (AdamW, Cosine)Accuracy8.1Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed ClassifierClean Accuracy85.21Unverified
2ResNet18/MART-ANCRAClean Accuracy60.1Unverified