Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 101–150 of 176 papers

Title	Date	Tasks	Status
Stealing Deep Reinforcement Learning Models for Fun and Profit	Jun 9, 2020	Decision MakingDeep Reinforcement Learning	—Unverified
Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack	Apr 13, 2021	Deep Reinforcement LearningModel extraction	—Unverified
Three-dimensional planar model estimation using multi-constraint knowledge based on k-means and RANSAC	Aug 3, 2017	ClusteringIndoor Localization	—Unverified
Towards dialogue based, computer aided software requirements elicitation	Oct 21, 2023	Model extraction	—Unverified
Towards Few-Call Model Stealing via Active Self-Paced Knowledge Distillation and Diffusion-Based Image Generation	Sep 29, 2023	Image GenerationKnowledge Distillation	—Unverified
Towards Security Threats of Deep Learning Systems: A Survey	Nov 28, 2019	Adversarial AttackDeep Learning	—Unverified
Unraveling Attacks in Machine Learning-based IoT Ecosystems: A Survey and the Open Libraries Behind Them	Jan 22, 2024	Anomaly DetectionModel extraction	—Unverified
Using Python for Model Inference in Deep Learning	Apr 1, 2021	Deep Learningmodel	—Unverified
Was my Model Stolen? Feature Sharing for Robust and Transferable Watermarks	Sep 29, 2021	Model extraction	—Unverified
Watermarking Graph Neural Networks based on Backdoor Attacks	Oct 21, 2021	ClassificationGraph Classification	—Unverified
Few-shot Model Extraction Attacks against Sequential Recommender Systems	Nov 18, 2024	Model extractionRecommendation Systems	—Unverified
Fingerprinting Deep Neural Networks Globally via Universal Adversarial Perturbations	Feb 17, 2022	Contrastive LearningModel extraction	—Unverified
First to Possess His Statistics: Data-Free Model Extraction Attack on Tabular Data	Sep 30, 2021	Medical DiagnosisModel extraction	—Unverified
"FRAME: Forward Recursive Adaptive Model Extraction -- A Technique for Advance Feature Selection"	Jan 21, 2025	Computational Efficiencyfeature selection	—Unverified
Fraternal Twins: Unifying Attacks on Machine Learning and Digital Watermarking	Mar 16, 2017	Autonomous DrivingBIG-bench Machine Learning	—Unverified
GENIE: Watermarking Graph Neural Networks for Link Prediction	Jun 7, 2024	Backdoor AttackDrug Discovery	—Unverified
Good Artists Copy, Great Artists Steal: Model Extraction Attacks Against Image Translation Models	Apr 26, 2021	Generative Adversarial Networkimage-classification	—Unverified
Grey-box Extraction of Natural Language Models	Jan 1, 2021	Model extraction	—Unverified
GrOVe: Ownership Verification of Graph Neural Networks using Embeddings	Apr 17, 2023	Model extraction	—Unverified
HODA: Hardness-Oriented Detection of Model Extraction Attacks	Jun 21, 2021	modelModel extraction	—Unverified
High Accuracy and High Fidelity Extraction of Neural Networks	Sep 3, 2019	Model extractionVocal Bursts Intensity Prediction	—Unverified
HODA: Protecting DNNs Against Model Extraction Attacks via Hardness of Samples	Sep 29, 2021	Model extraction	—Unverified
HoneypotNet: Backdoor Attacks Against Model Extraction	Jan 2, 2025	Backdoor Attackmodel	—Unverified
Increasing the Cost of Model Extraction with Calibrated Proof of Work	Jan 23, 2022	BIG-bench Machine LearningModel extraction	—Unverified
Interpretability via Model Extraction	Jun 29, 2017	BIG-bench Machine Learningmodel	—Unverified
Interpreting Blackbox Models via Model Extraction	May 23, 2017	modelModel extraction	—Unverified
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs	May 23, 2021	AttributeInference Attack	—Unverified
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified
Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models	Apr 28, 2024	Model extraction	—Unverified
Leveraging Extracted Model Adversaries for Improved Black Box Attacks	Oct 30, 2020	Model extractionQuestion Answering	—Unverified
Like an Open Book? Read Neural Network Architecture with Simple Power Analysis on 32-bit Microcontrollers	Nov 2, 2023	Model extraction	—Unverified
MaskedNet: The First Hardware Inference Engine Aiming Power Side-Channel Protection	Oct 29, 2019	BIG-bench Machine LearningModel extraction	—Unverified
MEAOD: Model Extraction Attack against Object Detectors	Dec 22, 2023	Active Learningmodel	—Unverified
MEGEX: Data-Free Model Extraction Attack against Gradient-Based Explainable AI	Jul 19, 2021	Explainable artificial intelligenceModel extraction	—Unverified
Mercury: An Automated Remote Side-channel Attack to Nvidia Deep Learning Accelerator	Aug 2, 2023	Model extraction	—Unverified
Mitigating Query-Flooding Parameter Duplication Attack on Regression Models with High-Dimensional Gaussian Mechanism	Feb 6, 2020	Model extractionregression	—Unverified
Model Extraction and Adversarial Attacks on Neural Networks using Switching Power Information	Jun 15, 2021	Model extraction	—Unverified
Robust and Minimally Invasive Watermarking for EaaS	Oct 23, 2024	Model extraction	CodeCode Available
WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection	Mar 3, 2024	Model extraction	CodeCode Available
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future Challenges	Oct 7, 2021	BenchmarkingModel extraction	CodeCode Available
Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks	Jun 24, 2023	Data AugmentationModel extraction	CodeCode Available
FLuID: Mitigating Stragglers in Federated Learning using Invariant Dropout	Jul 5, 2023	Federated LearningModel extraction	CodeCode Available
Stateful Detection of Model Extraction Attacks	Jul 12, 2021	BIG-bench Machine Learningmodel	CodeCode Available
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark	Dec 5, 2021	Document SummarizationImage Captioning	CodeCode Available
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks	Feb 7, 2025	counterfactualModel extraction	CodeCode Available
Model Extraction Attacks on Graph Neural Networks: Taxonomy and Realization	Oct 24, 2020	Anomaly DetectionModel extraction	CodeCode Available
ACTIVETHIEF: Model Extraction Using Active Learning and Unannotated Public Data	Feb 7, 2020	Active LearningBIG-bench Machine Learning	CodeCode Available
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available
Model extraction from counterfactual explanations	Sep 3, 2020	counterfactualmodel	CodeCode Available
GUIDO: A Hybrid Approach to Guideline Discovery & Ordering from Natural Language Texts	Jul 19, 2023	Dependency ParsingModel extraction	CodeCode Available

Show:10 25 50

← PrevPage 3 of 4Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified