SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 121–130 of 176 papers

Title	Date	Tasks	Status
High Accuracy and High Fidelity Extraction of Neural Networks	Sep 3, 2019	Model extractionVocal Bursts Intensity Prediction	—Unverified
HODA: Protecting DNNs Against Model Extraction Attacks via Hardness of Samples	Sep 29, 2021	Model extraction	—Unverified
HoneypotNet: Backdoor Attacks Against Model Extraction	Jan 2, 2025	Backdoor Attackmodel	—Unverified
Increasing the Cost of Model Extraction with Calibrated Proof of Work	Jan 23, 2022	BIG-bench Machine LearningModel extraction	—Unverified
Interpretability via Model Extraction	Jun 29, 2017	BIG-bench Machine Learningmodel	—Unverified
Interpreting Blackbox Models via Model Extraction	May 23, 2017	modelModel extraction	—Unverified
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs	May 23, 2021	AttributeInference Attack	—Unverified
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified
Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models	Apr 28, 2024	Model extraction	—Unverified
Leveraging Extracted Model Adversaries for Improved Black Box Attacks	Oct 30, 2020	Model extractionQuestion Answering	—Unverified

Show:10 25 50

← PrevPage 13 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified