SOTAVerified

Adversarial Robustness

Adversarial Robustness evaluates the vulnerabilities of machine learning models under various types of adversarial attacks.

Papers

Showing 401450 of 1746 papers

TitleStatusHype
Survey of Adversarial Robustness in Multimodal Large Language Models0
Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language ModelsCode0
Robust Dataset Distillation by Matching Adversarial Trajectories0
Robustness Tokens: Towards Adversarial Robustness of TransformersCode0
Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks0
FairDeFace: Evaluating the Fairness and Adversarial Robustness of Face Obfuscation MethodsCode0
MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation0
Long-tailed Adversarial Training with Self-Distillation0
Life-Cycle Routing Vulnerabilities of LLM Router0
Exploring Adversarial Transferability between Kolmogorov-arnold Networks0
Adversarial Robustness of Discriminative Self-Supervised Learning in Vision0
Transformer Meets Twicing: Harnessing Unattended Residual InformationCode0
Adversarial Robustness in Parameter-Space Classifiers0
Evaluation of Hate Speech Detection Using Large Language Models and Geographical ContextualizationCode0
Improved Diffusion-based Generative Model with Better Adversarial RobustnessCode0
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences0
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear InterpolationCode0
Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide0
Generalization Certificates for Adversarially Robust Bayesian Linear Regression0
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives0
Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives0
On the Promise for Assurance of Differentiable Neurosymbolic Reasoning Paradigms0
General Coded Computing: Adversarial Settings0
RoMA: Robust Malware Attribution via Byte-level Adversarial Training with Global Perturbations and Adversarial Consistency Regularization0
A Survey on Explainable Deep Reinforcement Learning0
Confidence Elicitation: A New Attack Vector for Large Language ModelsCode0
Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits0
Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models0
Improving Adversarial Robustness via Phase and Amplitude-aware Prompting0
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach0
Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks0
Adversarial Robustness in Two-Stage Learning-to-Defer: Algorithms and Guarantees0
Boosting Adversarial Robustness and Generalization with Structural Prior0
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models0
Trading Inference-Time Compute for Adversarial Robustness0
CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius MaximizationCode0
Topological Signatures of Adversaries in Multimodal Alignments0
Adversarial Masked Autoencoder Purifier with Defense Transferability0
Adversarial Robustness of Partitioned Quantum Classifiers0
Adversarially Robust Bloom Filters: Privacy, Reductions, and Open Problems0
VideoPure: Diffusion-based Adversarial Purification for Video RecognitionCode0
Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning0
A Note on Implementation Errors in Recent Adaptive Attacks Against Multi-Resolution Self-Ensembles0
Defending against Adversarial Malware Attacks on ML-based Android Malware Detection Systems0
Framework for Progressive Knowledge Fusion in Large Language Models Through Structured Conceptual Redundancy Analysis0
A margin-based replacement for cross-entropy loss0
With Great Backbones Comes Great Adversarial Transferability0
Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness0
I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution0
An Empirical Study of Accuracy-Robustness Tradeoff and Training Efficiency in Self-Supervised LearningCode0
Show:102550
← PrevPage 9 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeBERTa (single model)Accuracy0.61Unverified
2ALBERT (single model)Accuracy0.59Unverified
3T5 (single model)Accuracy0.57Unverified
4SMART_RoBERTa (single model)Accuracy0.54Unverified
5FreeLB (single model)Accuracy0.5Unverified
6RoBERTa (single model)Accuracy0.5Unverified
7InfoBERT (single model)Accuracy0.46Unverified
8ELECTRA (single model)Accuracy0.42Unverified
9BERT (single model)Accuracy0.34Unverified
10SMART_BERT (single model)Accuracy0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed classifierAccuracy95.23Unverified
2Stochastic-LWTA/PGD/WideResNet-34-10Accuracy92.26Unverified
3Stochastic-LWTA/PGD/WideResNet-34-5Accuracy91.88Unverified
4GLOT-DRAccuracy84.13Unverified
5TRADES-ANCRA/ResNet18Accuracy81.7Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (SGD, Cosine)Accuracy77.4Unverified
2ResNet-50 (SGD, Step)Accuracy76.9Unverified
3DeiT-S (AdamW, Cosine)Accuracy76.8Unverified
4ResNet-50 (AdamW, Cosine)Accuracy76.4Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy12.2Unverified
2ResNet-50 (SGD, Cosine)Accuracy3.3Unverified
3ResNet-50 (SGD, Step)Accuracy3.2Unverified
4ResNet-50 (AdamW, Cosine)Accuracy3.1Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (AdamW, Cosine)mean Corruption Error (mCE)59.3Unverified
2ResNet-50 (SGD, Step)mean Corruption Error (mCE)57.9Unverified
3ResNet-50 (SGD, Cosine)mean Corruption Error (mCE)56.9Unverified
4DeiT-S (AdamW, Cosine)mean Corruption Error (mCE)48Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy13Unverified
2ResNet-50 (SGD, Cosine)Accuracy8.4Unverified
3ResNet-50 (SGD, Step)Accuracy8.3Unverified
4ResNet-50 (AdamW, Cosine)Accuracy8.1Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed ClassifierClean Accuracy85.21Unverified
2ResNet18/MART-ANCRAClean Accuracy60.1Unverified