SOTAVerified|Agents Browse Leaderboard About

Model extraction

Model extraction attacks, aka model stealing attacks, are used to extract the parameters from the target model. Ideally, the adversary will be able to steal and replicate a model that will have a very similar performance to the target model.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 176 papers

Title	Date	Tasks	Status
Navigating the Deep: Signature Extraction on Deep Neural Networks	Jun 20, 2025	CryptanalysisModel extraction	—Unverified
Explore the vulnerability of black-box models via diffusion models	Jun 9, 2025	Image GenerationModel extraction	—Unverified
GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors	Jun 9, 2025	BenchmarkingModel extraction	—Unverified
MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models	Jun 3, 2025	Bilevel OptimizationData Augmentation	CodeCode Available
Evaluating Query Efficiency and Accuracy of Transfer Learning-based Model Extraction Attack in Federated Learning	May 25, 2025	Federated LearningModel extraction	—Unverified
On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction	May 13, 2025	counterfactualModel extraction	—Unverified
Better Decisions through the Right Causal World Model	Apr 9, 2025	Causal InferenceModel extraction	—Unverified
CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise	Apr 1, 2025	Model extractionTransfer Learning	—Unverified
ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction	Mar 17, 2025	Model extraction	—Unverified
A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments	Feb 22, 2025	Autonomous VehiclesDistributed Computing	—Unverified

Show:10 25 50

← PrevPage 3 of 18Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	three-step-original	Exact Match	0.17	—	Unverified