SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 126–150 of 176 papers

Title	Date	Tasks	Status
Interpreting Blackbox Models via Model Extraction	May 23, 2017	modelModel extraction	—Unverified
Killing One Bird with Two Stones: Model Extraction and Attribute Inference Attacks against BERT-based APIs	May 23, 2021	AttributeInference Attack	—Unverified
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified
Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models	Apr 28, 2024	Model extraction	—Unverified
Leveraging Extracted Model Adversaries for Improved Black Box Attacks	Oct 30, 2020	Model extractionQuestion Answering	—Unverified
Like an Open Book? Read Neural Network Architecture with Simple Power Analysis on 32-bit Microcontrollers	Nov 2, 2023	Model extraction	—Unverified
MaskedNet: The First Hardware Inference Engine Aiming Power Side-Channel Protection	Oct 29, 2019	BIG-bench Machine LearningModel extraction	—Unverified
MEAOD: Model Extraction Attack against Object Detectors	Dec 22, 2023	Active Learningmodel	—Unverified
MEGEX: Data-Free Model Extraction Attack against Gradient-Based Explainable AI	Jul 19, 2021	Explainable artificial intelligenceModel extraction	—Unverified
Mercury: An Automated Remote Side-channel Attack to Nvidia Deep Learning Accelerator	Aug 2, 2023	Model extraction	—Unverified
Mitigating Query-Flooding Parameter Duplication Attack on Regression Models with High-Dimensional Gaussian Mechanism	Feb 6, 2020	Model extractionregression	—Unverified
Model Extraction and Adversarial Attacks on Neural Networks using Switching Power Information	Jun 15, 2021	Model extraction	—Unverified
Robust and Minimally Invasive Watermarking for EaaS	Oct 23, 2024	Model extraction	CodeCode Available
WARDEN: Multi-Directional Backdoor Watermarks for Embedding-as-a-Service Copyright Protection	Mar 3, 2024	Model extraction	CodeCode Available
Process Extraction from Text: Benchmarking the State of the Art and Paving the Way for Future Challenges	Oct 7, 2021	BenchmarkingModel extraction	CodeCode Available
Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks	Jun 24, 2023	Data AugmentationModel extraction	CodeCode Available
FLuID: Mitigating Stragglers in Federated Learning using Invariant Dropout	Jul 5, 2023	Federated LearningModel extraction	CodeCode Available
Stateful Detection of Model Extraction Attacks	Jul 12, 2021	BIG-bench Machine Learningmodel	CodeCode Available
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark	Dec 5, 2021	Document SummarizationImage Captioning	CodeCode Available
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks	Feb 7, 2025	counterfactualModel extraction	CodeCode Available
Model Extraction Attacks on Graph Neural Networks: Taxonomy and Realization	Oct 24, 2020	Anomaly DetectionModel extraction	CodeCode Available
ACTIVETHIEF: Model Extraction Using Active Learning and Unannotated Public Data	Feb 7, 2020	Active LearningBIG-bench Machine Learning	CodeCode Available
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available
Model extraction from counterfactual explanations	Sep 3, 2020	counterfactualmodel	CodeCode Available
GUIDO: A Hybrid Approach to Guideline Discovery & Ordering from Natural Language Texts	Jul 19, 2023	Dependency ParsingModel extraction	CodeCode Available

Show:10 25 50

← PrevPage 6 of 8Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified