SOTAVerified

Adversarial Robustness

Adversarial Robustness evaluates the vulnerabilities of machine learning models under various types of adversarial attacks.

Papers

Showing 101150 of 1746 papers

TitleStatusHype
Improved Diffusion-based Generative Model with Better Adversarial RobustnessCode0
Emoti-Attack: Zero-Perturbation Adversarial Attacks on NLP Systems via Emoji Sequences0
Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement ErrorCode1
Mixup Model Merge: Enhancing Model Merging Performance through Randomized Linear InterpolationCode0
Probabilistic Robustness in Deep Learning: A Concise yet Comprehensive Guide0
Generalization Certificates for Adversarially Robust Bayesian Linear Regression0
Adversarial Alignment for LLMs Requires Simpler, Reproducible, and More Measurable Objectives0
Rethinking Audio-Visual Adversarial Vulnerability from Temporal and Modality Perspectives0
On the Promise for Assurance of Differentiable Neurosymbolic Reasoning Paradigms0
General Coded Computing: Adversarial Settings0
RoMA: Robust Malware Attribution via Byte-level Adversarial Training with Global Perturbations and Adversarial Consistency Regularization0
A Survey on Explainable Deep Reinforcement Learning0
Adversarially-Robust TD Learning with Markovian Data: Finite-Time Rates and Fundamental Limits0
Confidence Elicitation: A New Attack Vector for Large Language ModelsCode0
Hierarchical Contextual Manifold Alignment for Structuring Latent Representations in Large Language Models0
Improving Adversarial Robustness via Phase and Amplitude-aware Prompting0
Optimizing Robustness and Accuracy in Mixture of Experts: A Dual-Model Approach0
Uncertainty Quantification for Collaborative Object Detection Under Adversarial Attacks0
Adversarial Reasoning at Jailbreaking TimeCode1
Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language ModelsCode1
Adversarial Robustness in Two-Stage Learning-to-Defer: Algorithms and Guarantees0
SecPE: Secure Prompt Ensembling for Private and Robust Large Language Models0
Boosting Adversarial Robustness and Generalization with Structural Prior0
Trading Inference-Time Compute for Adversarial Robustness0
CAMP in the Odyssey: Provably Robust Reinforcement Learning with Certified Radius MaximizationCode0
Topological Signatures of Adversaries in Multimodal Alignments0
Adversarial Robustness of Partitioned Quantum Classifiers0
Adversarial Masked Autoencoder Purifier with Defense Transferability0
Adversarially Robust Bloom Filters: Privacy, Reductions, and Open Problems0
Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning0
VideoPure: Diffusion-based Adversarial Purification for Video RecognitionCode0
A Note on Implementation Errors in Recent Adaptive Attacks Against Multi-Resolution Self-Ensembles0
Framework for Progressive Knowledge Fusion in Large Language Models Through Structured Conceptual Redundancy Analysis0
Defending against Adversarial Malware Attacks on ML-based Android Malware Detection Systems0
A margin-based replacement for cross-entropy loss0
With Great Backbones Comes Great Adversarial Transferability0
Double Visual Defense: Adversarial Pre-training and Instruction Tuning for Improving Vision-Language Model Robustness0
An Adaptive Orthogonal Convolution Scheme for Efficient and Flexible CNN ArchitecturesCode1
I Can Find You in Seconds! Leveraging Large Language Models for Code Authorship Attribution0
An Empirical Study of Accuracy-Robustness Tradeoff and Training Efficiency in Self-Supervised LearningCode0
On the Adversarial Robustness of Benjamini Hochberg0
Tougher Text, Smarter Models: Raising the Bar for Adversarial Defence BenchmarksCode0
Towards Robust and Accurate Stability Estimation of Local Surrogate Models in Text-based Explainable AI0
Rerouting LLM Routers0
On the Zero-shot Adversarial Robustness of Vision-Language Models: A Truly Zero-shot and Training-free Approach0
Advancing Adversarial Robustness in GNeRFs: The IL2-NeRF AttackCode0
Rethinking the Adversarial Robustness of Multi-Exit Neural Networks in an Attack-Defense Game0
RobustBlack: Challenging Black-Box Adversarial Attacks on State-of-the-Art Defenses0
On Adversarial Robustness of Language Models in Transfer Learning0
A Robust Adversarial Ensemble with Causal (Feature Interaction) Interpretations for Image Classification0
Show:102550
← PrevPage 3 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeBERTa (single model)Accuracy0.61Unverified
2ALBERT (single model)Accuracy0.59Unverified
3T5 (single model)Accuracy0.57Unverified
4SMART_RoBERTa (single model)Accuracy0.54Unverified
5FreeLB (single model)Accuracy0.5Unverified
6RoBERTa (single model)Accuracy0.5Unverified
7InfoBERT (single model)Accuracy0.46Unverified
8ELECTRA (single model)Accuracy0.42Unverified
9BERT (single model)Accuracy0.34Unverified
10SMART_BERT (single model)Accuracy0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed classifierAccuracy95.23Unverified
2Stochastic-LWTA/PGD/WideResNet-34-10Accuracy92.26Unverified
3Stochastic-LWTA/PGD/WideResNet-34-5Accuracy91.88Unverified
4GLOT-DRAccuracy84.13Unverified
5TRADES-ANCRA/ResNet18Accuracy81.7Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (SGD, Cosine)Accuracy77.4Unverified
2ResNet-50 (SGD, Step)Accuracy76.9Unverified
3DeiT-S (AdamW, Cosine)Accuracy76.8Unverified
4ResNet-50 (AdamW, Cosine)Accuracy76.4Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy12.2Unverified
2ResNet-50 (SGD, Cosine)Accuracy3.3Unverified
3ResNet-50 (SGD, Step)Accuracy3.2Unverified
4ResNet-50 (AdamW, Cosine)Accuracy3.1Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (AdamW, Cosine)mean Corruption Error (mCE)59.3Unverified
2ResNet-50 (SGD, Step)mean Corruption Error (mCE)57.9Unverified
3ResNet-50 (SGD, Cosine)mean Corruption Error (mCE)56.9Unverified
4DeiT-S (AdamW, Cosine)mean Corruption Error (mCE)48Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy13Unverified
2ResNet-50 (SGD, Cosine)Accuracy8.4Unverified
3ResNet-50 (SGD, Step)Accuracy8.3Unverified
4ResNet-50 (AdamW, Cosine)Accuracy8.1Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed ClassifierClean Accuracy85.21Unverified
2ResNet18/MART-ANCRAClean Accuracy60.1Unverified