| MUSE: Machine Unlearning Six-Way Evaluation for Language Models | Jul 8, 2024 | ArticlesMachine Unlearning | CodeCode Available | 4 |
| SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders | Jan 29, 2025 | Adversarial AttackDenoising | CodeCode Available | 2 |
| Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench | Oct 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLM Defenses Are Not Robust to Multi-Turn Human Jailbreaks Yet | Aug 27, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Machine Unlearning in Generative AI: A Survey | Jul 30, 2024 | Machine UnlearningSurvey | CodeCode Available | 2 |
| RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models | Jun 16, 2024 | Adversarial AttackBenchmarking | CodeCode Available | 2 |
| Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models | May 24, 2024 | Image GenerationMachine Unlearning | CodeCode Available | 2 |
| Machine Unlearning of Pre-trained Large Language Models | Feb 23, 2024 | Machine Unlearning | CodeCode Available | 2 |
| UnlearnCanvas: Stylized Image Dataset for Enhanced Machine Unlearning Evaluation in Diffusion Models | Feb 19, 2024 | Image GenerationMachine Unlearning | CodeCode Available | 2 |
| Detecting Pretraining Data from Large Language Models | Oct 25, 2023 | Machine Unlearning | CodeCode Available | 2 |
| Machine Unlearning: Solutions and Challenges | Aug 14, 2023 | Machine Unlearning | CodeCode Available | 2 |
| A Survey of Machine Unlearning | Sep 6, 2022 | AttributeMachine Unlearning | CodeCode Available | 2 |
| Rectifying Privacy and Efficacy Measurements in Machine Unlearning: A New Inference Attack Perspective | Jun 16, 2025 | Inference AttackMachine Unlearning | CodeCode Available | 1 |
| Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble | Jun 16, 2025 | Machine Unlearning | CodeCode Available | 1 |
| Unlearning Isn't Invisible: Detecting Unlearning Traces in LLMs from Model Outputs | Jun 16, 2025 | Machine Unlearning | CodeCode Available | 1 |
| Certified Unlearning for Neural Networks | Jun 8, 2025 | Machine Unlearning | CodeCode Available | 1 |
| Rethinking Machine Unlearning in Image Generation Models | Jun 3, 2025 | BenchmarkingImage Generation | CodeCode Available | 1 |
| Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs | May 22, 2025 | DiagnosticMachine Unlearning | CodeCode Available | 1 |
| "Alexa, can you forget me?" Machine Unlearning Benchmark in Spoken Language Understanding | May 21, 2025 | Machine UnlearningSpoken Language Understanding | CodeCode Available | 1 |
| A Survey on Unlearnable Data | Mar 30, 2025 | Machine UnlearningSurvey | CodeCode Available | 1 |
| LoTUS: Large-Scale Machine Unlearning with a Taste of Uncertainty | Mar 24, 2025 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Safety Mirage: How Spurious Correlations Undermine VLM Safety Fine-tuning | Mar 14, 2025 | Machine Unlearning | CodeCode Available | 1 |
| Data Unlearning in Diffusion Models | Mar 2, 2025 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Erasing Without Remembering: Implicit Knowledge Forgetting in Large Language Models | Feb 27, 2025 | Machine Unlearning | CodeCode Available | 1 |
| MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Language Models | Feb 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Technical Report for the Forgotten-by-Design Project: Targeted Obfuscation for Machine Learning | Jan 20, 2025 | Inference AttackMachine Unlearning | CodeCode Available | 1 |
| Benchmarking Vision Language Model Unlearning via Fictitious Facial Identity Dataset | Nov 5, 2024 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |
| Identify Backdoored Model in Federated Learning via Individual Unlearning | Nov 1, 2024 | Anomaly DetectionFederated Learning | CodeCode Available | 1 |
| CLEAR: Character Unlearning in Textual and Visual Modalities | Oct 23, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Scalability of memorization-based machine unlearning | Oct 21, 2024 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Catastrophic Failure of LLM Unlearning via Quantization | Oct 21, 2024 | Machine UnlearningQuantization | CodeCode Available | 1 |
| Evaluating Deep Unlearning in Large Language Models | Oct 19, 2024 | Machine Unlearning | CodeCode Available | 1 |
| A Closer Look at Machine Unlearning for Large Language Models | Oct 10, 2024 | DiversityMachine Unlearning | CodeCode Available | 1 |
| NegMerge: Consensual Weight Negation for Strong Machine Unlearning | Oct 8, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| Mitigating Memorization In Language Models | Oct 3, 2024 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Unified Gradient-Based Machine Unlearning with Remain Geometry Enhancement | Sep 29, 2024 | Machine Unlearning | CodeCode Available | 1 |
| An Adversarial Perspective on Machine Unlearning for AI Safety | Sep 26, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Alternate Preference Optimization for Unlearning Factual Knowledge in Large Language Models | Sep 20, 2024 | Machine Unlearning | CodeCode Available | 1 |
| CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence | Aug 26, 2024 | FairnessMachine Unlearning | CodeCode Available | 1 |
| Towards Robust and Parameter-Efficient Knowledge Unlearning for LLMs | Aug 13, 2024 | Machine UnlearningMemorization | CodeCode Available | 1 |
| Preserving Privacy in Large Language Models: A Survey on Current Threats and Solutions | Aug 10, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Unlearning Targeted Information via Single Layer Unlearning Gradient | Jul 16, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Learning to Refuse: Towards Mitigating Privacy Risks in LLMs | Jul 14, 2024 | Machine Unlearning | CodeCode Available | 1 |
| On Large Language Model Continual Unlearning | Jul 14, 2024 | DisentanglementLanguage Modeling | CodeCode Available | 1 |
| Composable Interventions for Language Models | Jul 9, 2024 | knowledge editingMachine Unlearning | CodeCode Available | 1 |
| Soft Prompting for Unlearning in Large Language Models | Jun 17, 2024 | In-Context LearningMachine Unlearning | CodeCode Available | 1 |
| What makes unlearning hard and what to do about it | Jun 3, 2024 | Machine Unlearning | CodeCode Available | 1 |
| Automatic Jailbreaking of the Text-to-Image Generative AI Systems | May 26, 2024 | Image GenerationInformation Retrieval | CodeCode Available | 1 |
| Machine Unlearning in Large Language Models | May 24, 2024 | Machine UnlearningTruthfulQA | CodeCode Available | 1 |
| Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient | May 24, 2024 | Image GenerationMachine Unlearning | CodeCode Available | 1 |