Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 176 papers

Title	Date	Tasks	Status	Hype
Safety at Scale: A Comprehensive Survey of Large Model Safety	Feb 2, 2025	Autonomous DrivingData Poisoning	CodeCode Available	3
ATOM: A Framework of Detecting Query-Based Model Extraction Attacks for Graph Neural Networks	Mar 20, 2025	Model extraction	CodeCode Available	1
Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks	Jan 16, 2025	Model extraction	CodeCode Available	1
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation	Sep 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
MEA-Defender: A Robust Watermark against Model Extraction Attack	Jan 26, 2024	Model extractionSelf-Supervised Learning	CodeCode Available	1
Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service	Nov 10, 2023	Model extraction	CodeCode Available	1
Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark	May 17, 2023	Model extraction	CodeCode Available	1
Protecting Language Generation Models via Invisible Watermarking	Feb 6, 2023	Model extractionText Generation	CodeCode Available	1
FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction	Dec 3, 2022	Federated Learningmodel	CodeCode Available	1
Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction	Sep 1, 2021	Data PoisoningKnowledge Distillation	CodeCode Available	1
Model Extraction and Adversarial Transferability, Your BERT is Vulnerable!	Mar 18, 2021	Model extractiontext-classification	CodeCode Available	1
MEME: Generating RNN Model Explanations via Model Extraction	Dec 13, 2020	Decision Makingmodel	CodeCode Available	1
Data-Free Model Extraction	Nov 30, 2020	modelModel extraction	CodeCode Available	1
Now You See Me (CME): Concept-based Model Extraction	Oct 25, 2020	Model extraction	CodeCode Available	1
MEME: Generating RNN Model Explanations via Model Extraction	Oct 15, 2020	Decision Makingmodel	CodeCode Available	1
MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library	Apr 16, 2020	Model extractionMulti-agent Reinforcement Learning	CodeCode Available	1
Cryptanalytic Extraction of Neural Network Models	Mar 10, 2020	Model extraction	CodeCode Available	1
Entangled Watermarks as a Defense against Model Extraction	Feb 27, 2020	model	CodeCode Available	1
Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security	Jul 11, 2025	Model extractionQuantum Machine Learning	—Unverified	0
CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition	Jun 21, 2025	Model extraction	CodeCode Available	0
Navigating the Deep: Signature Extraction on Deep Neural Networks	Jun 20, 2025	CryptanalysisModel extraction	—Unverified	0
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors	Jun 9, 2025	BenchmarkingModel extraction	—Unverified	0
Explore the vulnerability of black-box models via diffusion models	Jun 9, 2025	Image GenerationModel extraction	—Unverified	0
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models	Jun 3, 2025	Bilevel OptimizationData Augmentation	CodeCode Available	0
Evaluating Query Efficiency and Accuracy of Transfer Learning-based Model Extraction Attack in Federated Learning	May 25, 2025	Federated LearningModel extraction	—Unverified	0
On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction	May 13, 2025	counterfactualModel extraction	—Unverified	0
Better Decisions through the Right Causal World Model	Apr 9, 2025	Causal InferenceModel extraction	—Unverified	0
CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise	Apr 1, 2025	Model extractionTransfer Learning	—Unverified	0
ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction	Mar 17, 2025	Model extraction	—Unverified	0
A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments	Feb 22, 2025	Autonomous VehiclesDistributed Computing	—Unverified	0
Differentially private fine-tuned NF-Net to predict GI cancer type	Feb 17, 2025	Model extraction	—Unverified	0
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks	Feb 7, 2025	counterfactualModel extraction	CodeCode Available	0
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified	0
Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI	Jan 28, 2025	Model extraction	—Unverified	0
"FRAME: Forward Recursive Adaptive Model Extraction -- A Technique for Advance Feature Selection"	Jan 21, 2025	Computational Efficiencyfeature selection	—Unverified	0
HoneypotNet: Backdoor Attacks Against Model Extraction	Jan 2, 2025	Backdoor Attackmodel	—Unverified	0
Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors	Nov 20, 2024	Model extractionobject-detection	—Unverified	0
Few-shot Model Extraction Attacks against Sequential Recommender Systems	Nov 18, 2024	Model extractionRecommendation Systems	—Unverified	0
A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks	Nov 15, 2024	Model extraction	CodeCode Available	0
Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark	Nov 14, 2024	Model extraction	CodeCode Available	0
Robust and Minimally Invasive Watermarking for EaaS	Oct 23, 2024	Model extraction	CodeCode Available	0
Efficient Model Extraction via Boundary Sampling	Oct 20, 2024	modelModel extraction	—Unverified	0
Efficient and Effective Model Extraction	Sep 21, 2024	Benchmarkingmodel	CodeCode Available	0
CaBaGe: Data-Free Model Extraction using ClAss BAlanced Generator Ensemble	Sep 16, 2024	Model extraction	—Unverified	0
Protecting Copyright of Medical Pre-trained Language Models: Training-Free Backdoor Model Watermarking	Sep 14, 2024	Model extractionWord Embeddings	—Unverified	0
VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces	Aug 4, 2024	image-classificationImage Classification	CodeCode Available	0
Enhancing TinyML Security: Study of Adversarial Attack Transferability	Jul 16, 2024	Adversarial AttackEdge-computing	—Unverified	0
QUEEN: Query Unlearning against Model Extraction	Jul 1, 2024	modelModel extraction	—Unverified	0
Privacy Implications of Explainable AI in Data-Driven Systems	Jun 22, 2024	counterfactualDecision Making	—Unverified	0
Beyond Slow Signs in High-fidelity Model Extraction	Jun 14, 2024	Benchmarkingmodel	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 4Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified