SOTAVerified

Adversarial Robustness

Adversarial Robustness evaluates the vulnerabilities of machine learning models under various types of adversarial attacks.

Papers

Showing 351400 of 1746 papers

TitleStatusHype
Erasing Concepts, Steering Generations: A Comprehensive Survey of Concept Suppression0
Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains0
Curvature Dynamic Black-box Attack: revisiting adversarial robustness via dynamic curvature estimation0
Enhancing Adversarial Robustness of Vision Language Models via Adversarial Mixture Prompt Tuning0
Experimental robustness benchmark of quantum neural network on a superconducting quantum processor0
Beyond Classification: Evaluating Diffusion Denoised Smoothing for Security-Utility Trade off0
Few-Shot Adversarial Low-Rank Fine-Tuning of Vision-Language Models0
Lessons from Defending Gemini Against Indirect Prompt Injections0
Recommender Systems for Democracy: Toward Adversarial Robustness in Voting Advice Applications0
Adversarial Robustness for Unified Multi-Modal Encoders via Efficient Calibration0
CARES: Comprehensive Evaluation of Safety and Adversarial Robustness in Medical LLMs0
Adversarially Robust Spiking Neural Networks with Sparse Connectivity0
Evaluating the Robustness of Adversarial Defenses in Malware Detection SystemsCode0
Dynamical Low-Rank Compression of Neural Networks with Robustness under Adversarial Attacks0
Unpacking Robustness in Inflectional Languages: Adversarial Evaluation and Mechanistic Insights0
ALMA: Aggregated Lipschitz Maximization Attack on Auto-encoders0
Adversarial Robustness Analysis of Vision-Language Models in Medical Image SegmentationCode0
Adversarial Robustness of Deep Learning Models for Inland Water Body Segmentation from SAR ImagesCode0
Quantum Support Vector Regression for Robust Anomaly Detection0
Towards Robust LLMs: an Adversarial Robustness Measurement FrameworkCode0
aiXamine: Simplified LLM Safety and Security0
Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos0
Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends0
Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models0
RDI: An adversarial robustness evaluation metric for deep neural networks based on model statistical featuresCode0
How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model?0
The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability0
Beyond Worst-Case Online Classification: VC-Based Regret Bounds for Relaxed Benchmarks0
Adversarial Examples in Environment Perception for Automated Driving (Review)0
Toward Spiking Neural Network Local Learning Modules Resistant to Adversarial Attacks0
Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-JudgeCode0
A Domain-Based Taxonomy of Jailbreak Vulnerabilities in Large Language Models0
Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability0
Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models0
A Study on Adversarial Robustness of Discriminative Prototypical LearningCode0
Bridging the Theoretical Gap in Randomized SmoothingCode0
AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization0
Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks0
Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks0
Feature Statistics with Uncertainty Help Adversarial RobustnessCode0
ATP: Adaptive Threshold Pruning for Efficient Data Encoding in Quantum Neural Networks0
Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent0
Masks and Mimicry: Strategic Obfuscation and Impersonation Attacks on Authorship Verification0
When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD0
Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability0
Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers0
Narrowing Class-Wise Robustness Gaps in Adversarial Training0
On the Robustness Tradeoff in Fine-Tuning0
MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models0
Survey of Adversarial Robustness in Multimodal Large Language Models0
Show:102550
← PrevPage 8 of 35Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DeBERTa (single model)Accuracy0.61Unverified
2ALBERT (single model)Accuracy0.59Unverified
3T5 (single model)Accuracy0.57Unverified
4SMART_RoBERTa (single model)Accuracy0.54Unverified
5FreeLB (single model)Accuracy0.5Unverified
6RoBERTa (single model)Accuracy0.5Unverified
7InfoBERT (single model)Accuracy0.46Unverified
8ELECTRA (single model)Accuracy0.42Unverified
9BERT (single model)Accuracy0.34Unverified
10SMART_BERT (single model)Accuracy0.3Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed classifierAccuracy95.23Unverified
2Stochastic-LWTA/PGD/WideResNet-34-10Accuracy92.26Unverified
3Stochastic-LWTA/PGD/WideResNet-34-5Accuracy91.88Unverified
4GLOT-DRAccuracy84.13Unverified
5TRADES-ANCRA/ResNet18Accuracy81.7Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (SGD, Cosine)Accuracy77.4Unverified
2ResNet-50 (SGD, Step)Accuracy76.9Unverified
3DeiT-S (AdamW, Cosine)Accuracy76.8Unverified
4ResNet-50 (AdamW, Cosine)Accuracy76.4Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy12.2Unverified
2ResNet-50 (SGD, Cosine)Accuracy3.3Unverified
3ResNet-50 (SGD, Step)Accuracy3.2Unverified
4ResNet-50 (AdamW, Cosine)Accuracy3.1Unverified
#ModelMetricClaimedVerifiedStatus
1ResNet-50 (AdamW, Cosine)mean Corruption Error (mCE)59.3Unverified
2ResNet-50 (SGD, Step)mean Corruption Error (mCE)57.9Unverified
3ResNet-50 (SGD, Cosine)mean Corruption Error (mCE)56.9Unverified
4DeiT-S (AdamW, Cosine)mean Corruption Error (mCE)48Unverified
#ModelMetricClaimedVerifiedStatus
1DeiT-S (AdamW, Cosine)Accuracy13Unverified
2ResNet-50 (SGD, Cosine)Accuracy8.4Unverified
3ResNet-50 (SGD, Step)Accuracy8.3Unverified
4ResNet-50 (AdamW, Cosine)Accuracy8.1Unverified
#ModelMetricClaimedVerifiedStatus
1Mixed ClassifierClean Accuracy85.21Unverified
2ResNet18/MART-ANCRAClean Accuracy60.1Unverified