Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–50 of 176 papers

Title	Date	Tasks	Status	Hype
Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security	Jul 11, 2025	Model extractionQuantum Machine Learning	—Unverified	0
CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition	Jun 21, 2025	Model extraction	CodeCode Available	0
Navigating the Deep: Signature Extraction on Deep Neural Networks	Jun 20, 2025	CryptanalysisModel extraction	—Unverified	0
Explore the vulnerability of black-box models via diffusion models	Jun 9, 2025	Image GenerationModel extraction	—Unverified	0
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors	Jun 9, 2025	BenchmarkingModel extraction	—Unverified	0
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models	Jun 3, 2025	Bilevel OptimizationData Augmentation	CodeCode Available	0
Evaluating Query Efficiency and Accuracy of Transfer Learning-based Model Extraction Attack in Federated Learning	May 25, 2025	Federated LearningModel extraction	—Unverified	0
On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction	May 13, 2025	counterfactualModel extraction	—Unverified	0
Better Decisions through the Right Causal World Model	Apr 9, 2025	Causal InferenceModel extraction	—Unverified	0
CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise	Apr 1, 2025	Model extractionTransfer Learning	—Unverified	0
ATOM: A Framework of Detecting Query-Based Model Extraction Attacks for Graph Neural Networks	Mar 20, 2025	Model extraction	CodeCode Available	1
ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction	Mar 17, 2025	Model extraction	—Unverified	0
A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments	Feb 22, 2025	Autonomous VehiclesDistributed Computing	—Unverified	0
Differentially private fine-tuned NF-Net to predict GI cancer type	Feb 17, 2025	Model extraction	—Unverified	0
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks	Feb 7, 2025	counterfactualModel extraction	CodeCode Available	0
A Framework for Double-Blind Federated Adaptation of Foundation Models	Feb 3, 2025	Federated Learningimage-classification	—Unverified	0
Safety at Scale: A Comprehensive Survey of Large Model Safety	Feb 2, 2025	Autonomous DrivingData Poisoning	CodeCode Available	3
Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI	Jan 28, 2025	Model extraction	—Unverified	0
"FRAME: Forward Recursive Adaptive Model Extraction -- A Technique for Advance Feature Selection"	Jan 21, 2025	Computational Efficiencyfeature selection	—Unverified	0
Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks	Jan 16, 2025	Model extraction	CodeCode Available	1
HoneypotNet: Backdoor Attacks Against Model Extraction	Jan 2, 2025	Backdoor Attackmodel	—Unverified	0
Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors	Nov 20, 2024	Model extractionobject-detection	—Unverified	0
Few-shot Model Extraction Attacks against Sequential Recommender Systems	Nov 18, 2024	Model extractionRecommendation Systems	—Unverified	0
A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks	Nov 15, 2024	Model extraction	CodeCode Available	0
Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark	Nov 14, 2024	Model extraction	CodeCode Available	0
Robust and Minimally Invasive Watermarking for EaaS	Oct 23, 2024	Model extraction	CodeCode Available	0
Efficient Model Extraction via Boundary Sampling	Oct 20, 2024	modelModel extraction	—Unverified	0
Efficient and Effective Model Extraction	Sep 21, 2024	Benchmarkingmodel	CodeCode Available	0
CaBaGe: Data-Free Model Extraction using ClAss BAlanced Generator Ensemble	Sep 16, 2024	Model extraction	—Unverified	0
Protecting Copyright of Medical Pre-trained Language Models: Training-Free Backdoor Model Watermarking	Sep 14, 2024	Model extractionWord Embeddings	—Unverified	0
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation	Sep 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces	Aug 4, 2024	image-classificationImage Classification	CodeCode Available	0
Enhancing TinyML Security: Study of Adversarial Attack Transferability	Jul 16, 2024	Adversarial AttackEdge-computing	—Unverified	0
QUEEN: Query Unlearning against Model Extraction	Jul 1, 2024	modelModel extraction	—Unverified	0
Privacy Implications of Explainable AI in Data-Driven Systems	Jun 22, 2024	counterfactualDecision Making	—Unverified	0
Beyond Slow Signs in High-fidelity Model Extraction	Jun 14, 2024	Benchmarkingmodel	CodeCode Available	0
GENIE: Watermarking Graph Neural Networks for Link Prediction	Jun 7, 2024	Backdoor AttackDrug Discovery	—Unverified	0
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available	0
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified	0
DeepNcode: Encoding-Based Protection against Bit-Flip Attacks on Neural Networks	May 22, 2024	Model extraction	—Unverified	0
Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory	May 8, 2024	counterfactualModel extraction	CodeCode Available	0
Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models	Apr 28, 2024	Model extraction	—Unverified	0
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available	0
QuantumLeak: Stealing Quantum Neural Networks from Cloud-based NISQ Machines	Mar 16, 2024	Model extraction	—Unverified	0
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data	Mar 15, 2024	Model extraction	CodeCode Available	0
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices	Mar 5, 2024	Model extraction	—Unverified	0
WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection	Mar 3, 2024	Model extraction	CodeCode Available	0
MEA-Defender: A Robust Watermark against Model Extraction Attack	Jan 26, 2024	Model extractionSelf-Supervised Learning	CodeCode Available	1
Unraveling Attacks in Machine Learning-based IoT Ecosystems: A Survey and the Open Libraries Behind Them	Jan 22, 2024	Anomaly DetectionModel extraction	—Unverified	0
MEAOD: Model Extraction Attack against Object Detectors	Dec 22, 2023	Active Learningmodel	—Unverified	0

Show:10 25 50

← PrevPage 1 of 4Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified