Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 176 papers

Title	Date	Tasks	Status	Hype
On the Difficulty of Defending Self-Supervised Learning against Model Extraction	May 16, 2022	Model extractionSelf-Supervised Learning	CodeCode Available	0
DualCF: Efficient Model Extraction Attack from Counterfactual Explanations	May 13, 2022	counterfactualCounterfactual Explanation	—Unverified	0
Stealing and Evading Malware Classifiers and Antivirus at Low False Positive Conditions	Apr 13, 2022	Active LearningMalware Detection	CodeCode Available	0
Split HE: Fast Secure Inference Combining Split Learning and Homomorphic Encryption	Feb 27, 2022	Model extraction	—Unverified	0
On the Effectiveness of Dataset Watermarking in Adversarial Settings	Feb 25, 2022	Model extraction	CodeCode Available	0
Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations	Feb 17, 2022	Contrastive LearningModel extraction	—Unverified	0
Increasing the Cost of Model Extraction with Calibrated Proof of Work	Jan 23, 2022	BIG-bench Machine LearningModel extraction	—Unverified	0
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark	Dec 5, 2021	Document SummarizationImage Captioning	CodeCode Available	0
Efficiently Learning One Hidden Layer ReLU Networks From Queries	Dec 1, 2021	Model extractionPAC learning	—Unverified	0
Efficiently Learning Any One Hidden Layer ReLU Network From Queries	Nov 8, 2021	Model extraction	—Unverified	0
DeepSteal: Advanced Model Extractions Leveraging Efficient Weight Stealing in Memories	Nov 8, 2021	Model extraction	—Unverified	0
Watermarking Graph Neural Networks based on Backdoor Attacks	Oct 21, 2021	ClassificationGraph Classification	—Unverified	0
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future Challenges	Oct 7, 2021	BenchmarkingModel extraction	CodeCode Available	0
First to Possess His Statistics: Data-Free Model Extraction Attack on Tabular Data	Sep 30, 2021	Medical DiagnosisModel extraction	—Unverified	0
HODA: Protecting DNNs Against Model Extraction Attacks via Hardness of Samples	Sep 29, 2021	Model extraction	—Unverified	0
A Novel Watermarking Framework for Ownership Verification of DNN Architectures	Sep 29, 2021	Model extractionNeural Architecture Search	—Unverified	0
NASPY: Automated Extraction of Automated Machine Learning Models	Sep 29, 2021	BIG-bench Machine LearningModel extraction	—Unverified	0
Was my Model Stolen? Feature Sharing for Robust and Transferable Watermarks	Sep 29, 2021	Model extraction	—Unverified	0
Emerging AI Security Threats for Autonomous Cars -- Case Studies	Sep 10, 2021	Autonomous VehiclesModel extraction	—Unverified	0
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction	Sep 1, 2021	Data PoisoningKnowledge Distillation	CodeCode Available	1
Student Surpasses Teacher: Imitation Attack for Black-Box NLP APIs	Aug 29, 2021	Domain AdaptationModel extraction	—Unverified	0
Power-Based Attacks on Spatial DNN Accelerators	Aug 28, 2021	Model extraction	—Unverified	0
MEGEX: Data-Free Model Extraction Attack against Gradient-Based Explainable AI	Jul 19, 2021	Explainable artificial intelligenceModel extraction	—Unverified	0
Stateful Detection of Model Extraction Attacks	Jul 12, 2021	BIG-bench Machine Learningmodel	CodeCode Available	0
HODA: Hardness-Oriented Detection of Model Extraction Attacks	Jun 21, 2021	modelModel extraction	—Unverified	0
Model Extraction and Adversarial Attacks on Neural Networks using Switching Power Information	Jun 15, 2021	Model extraction	—Unverified	0
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs	May 23, 2021	AttributeInference Attack	—Unverified	0
An Exact Poly-Time Membership-Queries Algorithm for Extraction a three-Layer ReLU Network	May 20, 2021	BIG-bench Machine LearningModel extraction	—Unverified	0
A Review of Confidentiality Threats Against Embedded Neural Network Models	May 4, 2021	Medical DiagnosisModel extraction	—Unverified	0
Good Artists Copy, Great Artists Steal: Model Extraction Attacks Against Image Translation Models	Apr 26, 2021	Generative Adversarial Networkimage-classification	—Unverified	0
Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack	Apr 13, 2021	Deep Reinforcement LearningModel extraction	—Unverified	0
Using Python for Model Inference in Deep Learning	Apr 1, 2021	Deep Learningmodel	—Unverified	0
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!	Mar 18, 2021	Model extractiontext-classification	CodeCode Available	1
BODAME: Bilevel Optimization for Defense Against Model Extraction	Mar 11, 2021	Bilevel Optimizationmodel	—Unverified	0
Model Extraction and Defenses on Generative Adversarial Networks	Jan 6, 2021	modelModel extraction	—Unverified	0
EXPLORING VULNERABILITIES OF BERT-BASED APIS	Jan 1, 2021	AttributeInference Attack	—Unverified	0
Grey-box Extraction of Natural Language Models	Jan 1, 2021	Model extraction	—Unverified	0
MEME: Generating RNN Model Explanations via Model Extraction	Dec 13, 2020	Decision Makingmodel	CodeCode Available	1
Sparsity-driven Digital Terrain Model Extraction	Dec 7, 2020	modelModel extraction	—Unverified	0
Data-Free Model Extraction	Nov 30, 2020	modelModel extraction	CodeCode Available	1
A Knowledge Representation Approach to Automated Mathematical Modelling	Nov 12, 2020	Combinatorial OptimizationModel extraction	—Unverified	0
Monitoring-based Differential Privacy Mechanism Against Query-Flooding Parameter Duplication Attack	Nov 1, 2020	Model extraction	—Unverified	0
Leveraging Extracted Model Adversaries for Improved Black Box Attacks	Oct 30, 2020	Model extractionQuestion Answering	—Unverified	0
Now You See Me (CME): Concept-based Model Extraction	Oct 25, 2020	Model extraction	CodeCode Available	1
Model Extraction Attacks on Graph Neural Networks: Taxonomy and Realization	Oct 24, 2020	Anomaly DetectionModel extraction	CodeCode Available	0
MEME: Generating RNN Model Explanations via Model Extraction	Oct 15, 2020	Decision Makingmodel	CodeCode Available	1
Model extraction from counterfactual explanations	Sep 3, 2020	counterfactualmodel	CodeCode Available	0
Stealing Deep Reinforcement Learning Models for Fun and Profit	Jun 9, 2020	Decision MakingDeep Reinforcement Learning	—Unverified	0
MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library	Apr 16, 2020	Model extractionMulti-agent Reinforcement Learning	CodeCode Available	1
Cryptanalytic Extraction of Neural Network Models	Mar 10, 2020	Model extraction	CodeCode Available	1

Show:10 25 50

← PrevPage 3 of 4Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified