| Differentially Private Steering for Large Language Model Alignment | Jan 30, 2025 | HallucinationInference Attack | CodeCode Available | 0 | 5 |
| Investigating Membership Inference Attacks under Data Dependencies | Oct 23, 2020 | BIG-bench Machine LearningInference Attack | CodeCode Available | 0 | 5 |
| DUCK: Distance-based Unlearning via Centroid Kinematics | Dec 4, 2023 | Inference AttackMachine Unlearning | CodeCode Available | 0 | 5 |
| Differentially Private Integrated Decision Gradients (IDG-DP) for Radar-based Human Activity Recognition | Nov 4, 2024 | Activity RecognitionHuman Activity Recognition | CodeCode Available | 0 | 5 |
| Membership Inference Attacks Against Object Detection Models | Jan 12, 2020 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Apollo: A Posteriori Label-Only Membership Inference Attack Towards Machine Unlearning | Jun 11, 2025 | Inference AttackMachine Unlearning | CodeCode Available | 0 | 5 |
| Membership Inference Attacks against Machine Learning Models | Oct 18, 2016 | BIG-bench Machine LearningGeneral Classification | CodeCode Available | 0 | 5 |
| Automatic Calibration for Membership Inference Attack on Large Language Models | May 6, 2025 | Inference AttackMembership Inference Attack | CodeCode Available | 0 | 5 |
| Generated Distributions Are All You Need for Membership Inference Attacks Against Generative Models | Oct 30, 2023 | AllInference Attack | CodeCode Available | 0 | 5 |
| GLiRA: Black-Box Membership Inference Attack via Knowledge Distillation | May 13, 2024 | image-classificationImage Classification | CodeCode Available | 0 | 5 |