SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 176 papers

Title	Date	Tasks	Status	Hype
"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation	Sep 4, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces	Aug 4, 2024	image-classificationImage Classification	CodeCode Available	0
Enhancing TinyML Security: Study of Adversarial Attack Transferability	Jul 16, 2024	Adversarial AttackEdge-computing	—Unverified	0
QUEEN: Query Unlearning against Model Extraction	Jul 1, 2024	modelModel extraction	—Unverified	0
Privacy Implications of Explainable AI in Data-Driven Systems	Jun 22, 2024	counterfactualDecision Making	—Unverified	0
Beyond Slow Signs in High-fidelity Model Extraction	Jun 14, 2024	Benchmarkingmodel	CodeCode Available	0
GENIE: Watermarking Graph Neural Networks for Link Prediction	Jun 7, 2024	Backdoor AttackDrug Discovery	—Unverified	0
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available	0
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified	0
DeepNcode: Encoding-Based Protection against Bit-Flip Attacks on Neural Networks	May 22, 2024	Model extraction	—Unverified	0

Show:10 25 50

← PrevPage 4 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified