Safety at Scale: A Comprehensive Survey of Large Model Safety Feb 2, 2025 Autonomous Driving Data Poisoning
Code Code Available 3ATOM: A Framework of Detecting Query-Based Model Extraction Attacks for Graph Neural Networks Mar 20, 2025 Model extraction
Code Code Available 1Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks Jan 16, 2025 Model extraction
Code Code Available 1"Yes, My LoRD." Guiding Language Model Extraction with Locality Reinforced Distillation Sep 4, 2024 Language Modeling Language Modelling
Code Code Available 1MEA-Defender: A Robust Watermark against Model Extraction Attack Jan 26, 2024 Model extraction Self-Supervised Learning
Code Code Available 1Watermarking Vision-Language Pre-trained Models for Multi-modal Embedding as a Service Nov 10, 2023 Model extraction
Code Code Available 1Are You Copying My Model? Protecting the Copyright of Large Language Models for EaaS via Backdoor Watermark May 17, 2023 Model extraction
Code Code Available 1Protecting Language Generation Models via Invisible Watermarking Feb 6, 2023 Model extraction Text Generation
Code Code Available 1FedRolex: Model-Heterogeneous Federated Learning with Rolling Sub-Model Extraction Dec 3, 2022 Federated Learning model
Code Code Available 1Black-Box Attacks on Sequential Recommenders via Data-Free Model Extraction Sep 1, 2021 Data Poisoning Knowledge Distillation
Code Code Available 1Model Extraction and Adversarial Transferability, Your BERT is Vulnerable! Mar 18, 2021 Model extraction text-classification
Code Code Available 1MEME: Generating RNN Model Explanations via Model Extraction Dec 13, 2020 Decision Making model
Code Code Available 1Data-Free Model Extraction Nov 30, 2020 model Model extraction
Code Code Available 1Now You See Me (CME): Concept-based Model Extraction Oct 25, 2020 Model extraction
Code Code Available 1MEME: Generating RNN Model Explanations via Model Extraction Oct 15, 2020 Decision Making model
Code Code Available 1MARLeME: A Multi-Agent Reinforcement Learning Model Extraction Library Apr 16, 2020 Model extraction Multi-agent Reinforcement Learning
Code Code Available 1Cryptanalytic Extraction of Neural Network Models Mar 10, 2020 Model extraction
Code Code Available 1Entangled Watermarks as a Defense against Model Extraction Feb 27, 2020 model
Code Code Available 1Entangled Threats: A Unified Kill Chain Model for Quantum Machine Learning Security Jul 11, 2025 Model extraction Quantum Machine Learning
— Unverified 0CEGA: A Cost-Effective Approach for Graph-Based Model Extraction and Acquisition Jun 21, 2025 Model extraction
Code Code Available 0Navigating the Deep: Signature Extraction on Deep Neural Networks Jun 20, 2025 Cryptanalysis Model extraction
— Unverified 0GradEscape: A Gradient-Based Evader Against AI-Generated Text Detectors Jun 9, 2025 Benchmarking Model extraction
— Unverified 0Explore the vulnerability of black-box models via diffusion models Jun 9, 2025 Image Generation Model extraction
— Unverified 0MISLEADER: Defending against Model Extraction with Ensembles of Distilled Models Jun 3, 2025 Bilevel Optimization Data Augmentation
Code Code Available 0Evaluating Query Efficiency and Accuracy of Transfer Learning-based Model Extraction Attack in Federated Learning May 25, 2025 Federated Learning Model extraction
— Unverified 0On the interplay of Explainability, Privacy and Predictive Performance with Explanation-assisted Model Extraction May 13, 2025 counterfactual Model extraction
— Unverified 0Better Decisions through the Right Causal World Model Apr 9, 2025 Causal Inference Model extraction
— Unverified 0CopyQNN: Quantum Neural Network Extraction Attack under Varying Quantum Noise Apr 1, 2025 Model extraction Transfer Learning
— Unverified 0ProDiF: Protecting Domain-Invariant Features to Secure Pre-Trained Models Against Extraction Mar 17, 2025 Model extraction
— Unverified 0A Survey of Model Extraction Attacks and Defenses in Distributed Computing Environments Feb 22, 2025 Autonomous Vehicles Distributed Computing
— Unverified 0Differentially private fine-tuned NF-Net to predict GI cancer type Feb 17, 2025 Model extraction
— Unverified 0From Counterfactuals to Trees: Competitive Analysis of Model Extraction Attacks Feb 7, 2025 counterfactual Model extraction
Code Code Available 0A Framework for Double-Blind Federated Adaptation of Foundation Models Feb 3, 2025 Federated Learning image-classification
— Unverified 0Data-Free Model-Related Attacks: Unleashing the Potential of Generative AI Jan 28, 2025 Model extraction
— Unverified 0"FRAME: Forward Recursive Adaptive Model Extraction -- A Technique for Advance Feature Selection" Jan 21, 2025 Computational Efficiency feature selection
— Unverified 0HoneypotNet: Backdoor Attacks Against Model Extraction Jan 2, 2025 Backdoor Attack model
— Unverified 0Bounding-box Watermarking: Defense against Model Extraction Attacks on Object Detectors Nov 20, 2024 Model extraction object-detection
— Unverified 0Few-shot Model Extraction Attacks against Sequential Recommender Systems Nov 18, 2024 Model extraction Recommendation Systems
— Unverified 0A Hard-Label Cryptanalytic Extraction of Non-Fully Connected Deep Neural Networks using Side-Channel Attacks Nov 15, 2024 Model extraction
Code Code Available 0Your Semantic-Independent Watermark is Fragile: A Semantic Perturbation Attack against EaaS Watermark Nov 14, 2024 Model extraction
Code Code Available 0Robust and Minimally Invasive Watermarking for EaaS Oct 23, 2024 Model extraction
Code Code Available 0Efficient Model Extraction via Boundary Sampling Oct 20, 2024 model Model extraction
— Unverified 0Efficient and Effective Model Extraction Sep 21, 2024 Benchmarking model
Code Code Available 0CaBaGe: Data-Free Model Extraction using ClAss BAlanced Generator Ensemble Sep 16, 2024 Model extraction
— Unverified 0Protecting Copyright of Medical Pre-trained Language Models: Training-Free Backdoor Model Watermarking Sep 14, 2024 Model extraction Word Embeddings
— Unverified 0VidModEx: Interpretable and Efficient Black Box Model Extraction for High-Dimensional Spaces Aug 4, 2024 image-classification Image Classification
Code Code Available 0Enhancing TinyML Security: Study of Adversarial Attack Transferability Jul 16, 2024 Adversarial Attack Edge-computing
— Unverified 0QUEEN: Query Unlearning against Model Extraction Jul 1, 2024 model Model extraction
— Unverified 0Privacy Implications of Explainable AI in Data-Driven Systems Jun 22, 2024 counterfactual Decision Making
— Unverified 0Beyond Slow Signs in High-fidelity Model Extraction Jun 14, 2024 Benchmarking model
Code Code Available 0