SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 141–150 of 176 papers

Title	Date	Tasks	Status
Weighted Automata Extraction and Explanation of Recurrent Neural Networks for Natural Language Tasks	Jun 24, 2023	Data AugmentationModel extraction	CodeCode Available
FLuID: Mitigating Stragglers in Federated Learning using Invariant Dropout	Jul 5, 2023	Federated LearningModel extraction	CodeCode Available
Stateful Detection of Model Extraction Attacks	Jul 12, 2021	BIG-bench Machine Learningmodel	CodeCode Available
Protecting Intellectual Property of Language Generation APIs with Lexical Watermark	Dec 5, 2021	Document SummarizationImage Captioning	CodeCode Available
From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks	Feb 7, 2025	counterfactualModel extraction	CodeCode Available
Model Extraction Attacks on Graph Neural Networks: Taxonomy and Realization	Oct 24, 2020	Anomaly DetectionModel extraction	CodeCode Available
ACTIVETHIEF: Model Extraction Using Active Learning and Unannotated Public Data	Feb 7, 2020	Active LearningBIG-bench Machine Learning	CodeCode Available
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available
Model extraction from counterfactual explanations	Sep 3, 2020	counterfactualmodel	CodeCode Available
GUIDO: A Hybrid Approach to Guideline Discovery & Ordering from Natural Language Texts	Jul 19, 2023	Dependency ParsingModel extraction	CodeCode Available

Show:10 25 50

← PrevPage 15 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified