SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 176 papers

Title	Date	Tasks	Status
Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security	Jul 11, 2025	Model extractionQuantum Machine Learning	—Unverified
CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition	Jun 21, 2025	Model extraction	CodeCode Available
Navigating the Deep: Signature Extraction on Deep Neural Networks	Jun 20, 2025	CryptanalysisModel extraction	—Unverified
Explore the vulnerability of black-box models via diffusion models	Jun 9, 2025	Image GenerationModel extraction	—Unverified
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors	Jun 9, 2025	BenchmarkingModel extraction	—Unverified
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models	Jun 3, 2025	Bilevel OptimizationData Augmentation	CodeCode Available
Evaluating Query Efficiency and Accuracy of Transfer Learning-based Model Extraction Attack in Federated Learning	May 25, 2025	Federated LearningModel extraction	—Unverified
On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction	May 13, 2025	counterfactualModel extraction	—Unverified
Better Decisions through the Right Causal World Model	Apr 9, 2025	Causal InferenceModel extraction	—Unverified
CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise	Apr 1, 2025	Model extractionTransfer Learning	—Unverified

Show:10 25 50

← PrevPage 1 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified