| Fast and Low-Cost Genomic Foundation Models via Outlier Removal | May 1, 2025 | Adversarial AttackAdversarial Robustness | CodeCode Available | 1 |
| OET: Optimization-based prompt injection Evaluation Toolkit | May 1, 2025 | Adversarial RobustnessNatural Language Understanding | CodeCode Available | 1 |
| Towards Robust LLMs: an Adversarial Robustness Measurement Framework | Apr 24, 2025 | Adversarial RobustnessComputational Efficiency | CodeCode Available | 0 |
| Multimodal Large Language Models for Enhanced Traffic Safety: A Comprehensive Review and Future Trends | Apr 21, 2025 | Adversarial RobustnessDecision Making | —Unverified | 0 |
| Fast Adversarial Training with Weak-to-Strong Spatial-Temporal Consistency in the Frequency Domain on Videos | Apr 21, 2025 | Adversarial RobustnessVideo Recognition | —Unverified | 0 |
| aiXamine: Simplified LLM Safety and Security | Apr 21, 2025 | 2kAdversarial Robustness | —Unverified | 0 |
| Hydra: An Agentic Reasoning Approach for Enhancing Adversarial Robustness and Mitigating Hallucinations in Vision-Language Models | Apr 19, 2025 | Adversarial AttackAdversarial Defense | —Unverified | 0 |
| RDI: An adversarial robustness evaluation metric for deep neural networks based on model statistical features | Apr 16, 2025 | Adversarial RobustnessComputational Efficiency | CodeCode Available | 0 |
| The Sword of Damocles in ViTs: Computational Redundancy Amplifies Adversarial Transferability | Apr 15, 2025 | Adversarial Robustness | —Unverified | 0 |
| R-TPT: Improving Adversarial Robustness of Vision-Language Models through Test-Time Prompt Tuning | Apr 15, 2025 | Adversarial Robustness | CodeCode Available | 1 |
| How to Enhance Downstream Adversarial Robustness (almost) without Touching the Pre-Trained Foundation Model? | Apr 15, 2025 | Adversarial RobustnessContrastive Learning | —Unverified | 0 |
| Beyond Worst-Case Online Classification: VC-Based Regret Bounds for Relaxed Benchmarks | Apr 14, 2025 | Adversarial RobustnessBinary Classification | —Unverified | 0 |
| Toward Spiking Neural Network Local Learning Modules Resistant to Adversarial Attacks | Apr 11, 2025 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| Adversarial Examples in Environment Perception for Automated Driving (Review) | Apr 11, 2025 | Adversarial Robustness | —Unverified | 0 |
| Benchmarking Adversarial Robustness to Bias Elicitation in Large Language Models: Scalable Automated Assessment with LLM-as-a-Judge | Apr 10, 2025 | Adversarial RobustnessBenchmarking | CodeCode Available | 0 |
| Secure Diagnostics: Adversarial Robustness Meets Clinical Interpretability | Apr 7, 2025 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| A Domain-Based Taxonomy of Jailbreak Vulnerabilities in Large Language Models | Apr 7, 2025 | Adversarial Robustness | —Unverified | 0 |
| Two is Better than One: Efficient Ensemble Defense for Robust and Compact Models | Apr 7, 2025 | Adversarial RobustnessDiversity | —Unverified | 0 |
| A Study on Adversarial Robustness of Discriminative Prototypical Learning | Apr 3, 2025 | Adversarial Robustness | CodeCode Available | 0 |
| Bridging the Theoretical Gap in Randomized Smoothing | Apr 3, 2025 | Adversarial Robustness | CodeCode Available | 0 |
| AdPO: Enhancing the Adversarial Robustness of Large Vision-Language Models with Preference Optimization | Apr 2, 2025 | Adversarial DefenseAdversarial Robustness | —Unverified | 0 |
| Robust Unsupervised Domain Adaptation for 3D Point Cloud Segmentation Under Source Adversarial Attacks | Apr 2, 2025 | Adversarial RobustnessDecoder | —Unverified | 0 |
| ATP: Adaptive Threshold Pruning for Efficient Data Encoding in Quantum Neural Networks | Mar 26, 2025 | Adversarial RobustnessComputational Efficiency | —Unverified | 0 |
| Lipschitz Constant Meets Condition Number: Learning Robust and Compact Deep Neural Networks | Mar 26, 2025 | Adversarial RobustnessNetwork Pruning | —Unverified | 0 |
| Feature Statistics with Uncertainty Help Adversarial Robustness | Mar 26, 2025 | Adversarial Robustness | CodeCode Available | 0 |
| Stop Walking in Circles! Bailing Out Early in Projected Gradient Descent | Mar 25, 2025 | Adversarial Robustness | —Unverified | 0 |
| Masks and Mimicry: Strategic Obfuscation and Impersonation Attacks on Authorship Verification | Mar 24, 2025 | Adversarial RobustnessAuthorship Verification | —Unverified | 0 |
| When is dataset cartography ineffective? Using training dynamics does not improve robustness against Adversarial SQuAD | Mar 24, 2025 | Adversarial RobustnessExtractive Question-Answering | —Unverified | 0 |
| Principal Eigenvalue Regularization for Improved Worst-Class Certified Robustness of Smoothed Classifiers | Mar 21, 2025 | Adversarial RobustnessFairness | —Unverified | 0 |
| Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability | Mar 21, 2025 | Adversarial RobustnessBayesian Optimization | —Unverified | 0 |
| Narrowing Class-Wise Robustness Gaps in Adversarial Training | Mar 20, 2025 | Adversarial RobustnessClass-Specific Performance | —Unverified | 0 |
| On the Robustness Tradeoff in Fine-Tuning | Mar 19, 2025 | Adversarial Robustness | —Unverified | 0 |
| MMDT: Decoding the Trustworthiness and Safety of Multimodal Foundation Models | Mar 19, 2025 | Adversarial RobustnessAutonomous Driving | —Unverified | 0 |
| Unveiling the Role of Randomization in Multiclass Adversarial Classification: Insights from Graph Theory | Mar 18, 2025 | Adversarial RobustnessBinary Classification | —Unverified | 0 |
| Survey of Adversarial Robustness in Multimodal Large Language Models | Mar 18, 2025 | Adversarial RobustnessSurvey | —Unverified | 0 |
| Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models | Mar 17, 2025 | Adversarial RobustnessPrompt Learning | CodeCode Available | 0 |
| Robust Dataset Distillation by Matching Adversarial Trajectories | Mar 15, 2025 | Adversarial RobustnessDataset Distillation | —Unverified | 0 |
| Robustness Tokens: Towards Adversarial Robustness of Transformers | Mar 13, 2025 | Adversarial Robustness | CodeCode Available | 0 |
| Quantitative Analysis of Deeply Quantized Tiny Neural Networks Robust to Adversarial Attacks | Mar 12, 2025 | Adversarial RobustnessQuantization | —Unverified | 0 |
| FairDeFace: Evaluating the Fairness and Adversarial Robustness of Face Obfuscation Methods | Mar 11, 2025 | Adversarial RobustnessFace Detection | CodeCode Available | 0 |
| Life-Cycle Routing Vulnerabilities of LLM Router | Mar 9, 2025 | Adversarial Robustness | —Unverified | 0 |
| MMARD: Improving the Min-Max Optimization Process in Adversarial Robustness Distillation | Mar 9, 2025 | Adversarial Robustness | —Unverified | 0 |
| Long-tailed Adversarial Training with Self-Distillation | Mar 9, 2025 | Adversarial Robustness | —Unverified | 0 |
| Exploring Adversarial Transferability between Kolmogorov-arnold Networks | Mar 8, 2025 | Adversarial RobustnessKolmogorov-Arnold Networks | —Unverified | 0 |
| Adversarial Robustness of Discriminative Self-Supervised Learning in Vision | Mar 8, 2025 | Adversarial RobustnessLinear evaluation | —Unverified | 0 |
| CLIP is Strong Enough to Fight Back: Test-time Counterattacks towards Zero-shot Adversarial Robustness of CLIP | Mar 5, 2025 | Adversarial RobustnessImage-text matching | CodeCode Available | 1 |
| TAET: Two-Stage Adversarial Equalization Training on Long-Tailed Distributions | Mar 2, 2025 | Adversarial RobustnessComputational Efficiency | CodeCode Available | 1 |
| Transformer Meets Twicing: Harnessing Unattended Residual Information | Mar 2, 2025 | Adversarial Robustnessimage-classification | CodeCode Available | 0 |
| Adversarial Robustness in Parameter-Space Classifiers | Feb 27, 2025 | Adversarial Robustness | —Unverified | 0 |
| Evaluation of Hate Speech Detection Using Large Language Models and Geographical Contextualization | Feb 26, 2025 | Adversarial RobustnessBinary Classification | CodeCode Available | 0 |