SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 151–160 of 176 papers

Title	Date	Tasks	Status
Stealing Machine Learning Models via Prediction APIs	Sep 9, 2016	BIG-bench Machine LearningLearning Theory	CodeCode Available
Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory	May 8, 2024	counterfactualModel extraction	CodeCode Available
Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions	Apr 13, 2022	Active LearningMalware Detection	CodeCode Available
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning	Aug 31, 2023	Adversarial Attack	CodeCode Available
Safe and Robust Watermark Injection with a Single OoD Image	Sep 4, 2023	Model extraction	CodeCode Available
VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces	Aug 4, 2024	image-classificationImage Classification	CodeCode Available
Army of Thieves: Enhancing Black-Box Model Extraction via Ensemble based sample selection	Nov 8, 2023	Active LearningAdversarial Attack	CodeCode Available
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data	Mar 15, 2024	Model extraction	CodeCode Available
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available
A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks	Nov 15, 2024	Model extraction	CodeCode Available

Show:10 25 50

← PrevPage 16 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified