SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 176 papers

Title	Date	Tasks	Status
GENIE: Watermarking Graph Neural Networks for Link Prediction	Jun 7, 2024	Backdoor AttackDrug Discovery	—Unverified
Watermarking Counterfactual Explanations	May 29, 2024	counterfactualExplainable artificial intelligence	CodeCode Available
Noisy Data Meets Privacy: Training Local Models with Post-Processed Remote Queries	May 25, 2024	Knowledge DistillationModel extraction	—Unverified
DeepNcode: Encoding-Based Protection against Bit-Flip Attacks on Neural Networks	May 22, 2024	Model extraction	—Unverified
Model Reconstruction Using Counterfactual Explanations: A Perspective From Polytope Theory	May 8, 2024	counterfactualModel extraction	CodeCode Available
Learnable Linguistic Watermarks for Tracing Model Extraction Attacks on Large Language Models	Apr 28, 2024	Model extraction	—Unverified
Knowledge Distillation-Based Model Extraction Attack using GAN-based Private Counterfactual Explanations	Apr 4, 2024	counterfactualKnowledge Distillation	CodeCode Available
QuantumLeak: Stealing Quantum Neural Networks from Cloud-based NISQ Machines	Mar 16, 2024	Model extraction	—Unverified
Not Just Change the Labels, Learn the Features: Watermarking Deep Neural Networks with Multi-View Data	Mar 15, 2024	Model extraction	CodeCode Available
Precise Extraction of Deep Learning Models via Side-Channel Attacks on Edge/Endpoint Devices	Mar 5, 2024	Model extraction	—Unverified

Show:10 25 50

← PrevPage 6 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified