Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 476–500 of 1356 papers

Title	Date	Tasks	Status
Enabling All In-Edge Deep Learning: A Literature Review	Apr 7, 2022	AllDeep Learning	—Unverified
Automated Inference of Graph Transformation Rules	Apr 3, 2024	Model Compression	—Unverified
Data-Free Knowledge Transfer: A Survey	Dec 31, 2021	Data-free Knowledge DistillationDomain Adaptation	—Unverified
Auto Graph Encoder-Decoder for Neural Network Pruning	Nov 25, 2020	DecoderModel Compression	—Unverified
A Low-Power Streaming Speech Enhancement Accelerator For Edge Devices	Mar 27, 2025	Model CompressionSpeech Enhancement	—Unverified
Acoustic Model Compression with MAP adaptation	May 1, 2017	Automatic Speech Recognition (ASR)model	—Unverified
Data-Free Knowledge Distillation Using Adversarially Perturbed OpenGL Shader Images	Oct 20, 2023	Data AugmentationData-free Knowledge Distillation	—Unverified
Enhanced Sparsification via Stimulative Training	Mar 11, 2024	Knowledge DistillationModel Compression	—Unverified
AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models	Jan 21, 2022	Bayesian OptimizationKnowledge Distillation	—Unverified
Compacting Deep Neural Networks for Internet of Things: Methods and Applications	Mar 20, 2021	DiversityKnowledge Distillation	—Unverified
Enhancing Inference Efficiency of Large Language Models: Investigating Optimization Strategies and Architectural Innovations	Apr 2, 2024	Model Compression	—Unverified
FASP: Fast and Accurate Structured Pruning of Large Language Models	Jan 16, 2025	GPUModel Compression	—Unverified
Enhancing Targeted Attack Transferability via Diversified Weight Pruning	Aug 18, 2022	DiversityModel Compression	—Unverified
A Low Effort Approach to Structured CNN Design Using PCA	Dec 15, 2018	Dimensionality ReductionModel Compression	—Unverified
A Lottery Ticket Hypothesis Framework for Low-Complexity Device-Robust Neural Acoustic Scene Classification	Jul 3, 2021	Acoustic Scene ClassificationData Augmentation	—Unverified
Data-Free Distillation of Language Model by Text-to-Text Transfer	Nov 3, 2023	Data-free Knowledge DistillationDiversity	—Unverified
EPIM: Efficient Processing-In-Memory Accelerators based on Epitome	Nov 12, 2023	Model CompressionNeural Architecture Search	—Unverified
26ms Inference Time for ResNet-50: Towards Real-Time Execution of all DNNs on Smartphone	May 2, 2019	AllModel Compression	—Unverified
Error-aware Quantization through Noise Tempering	Dec 11, 2022	Model CompressionQuantization	—Unverified
Data-Free Adversarial Knowledge Distillation for Graph Neural Networks	May 8, 2022	Generative Adversarial NetworkGraph Classification	—Unverified
Every Expert Matters: Towards Effective Knowledge Distillation for Mixture-of-Experts Language Models	Feb 18, 2025	Knowledge DistillationMixture-of-Experts	—Unverified
AutoBSS: An Efficient Algorithm for Block Stacking Style Search	Oct 20, 2020	AutoMLBayesian Optimization	—Unverified
Failure-Resilient Distributed Inference with Model Compression over Heterogeneous Edge Devices	Jun 20, 2024	Knowledge DistillationModel Compression	—Unverified
Fast DistilBERT on CPUs	Oct 27, 2022	Knowledge DistillationModel Compression	—Unverified
Feature Interaction Fusion Self-Distillation Network For CTR Prediction	Nov 12, 2024	Click-Through Rate PredictionKnowledge Distillation	—Unverified

Show:10 25 50

← PrevPage 20 of 55Next →

All datasets ImageNet QNLI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ADLIK-MO-ResNet50+W4A4	Top-1	77.88	—	Unverified
2	ADLIK-MO-ResNet50+W3A4	Top-1	77.34	—	Unverified
3	ResNet-18 + 4bit-1dim model compression using DKM	Top-1	70.52	—	Unverified
4	MobileNet-v1 + 4bit-1dim model compression using DKM	Top-1	69.63	—	Unverified
5	ResNet-18 + 2bit-1dim model compression using DKM	Top-1	68.63	—	Unverified
6	MobileNet-v1 + 2bit-1dim model compression using DKM	Top-1	67.62	—	Unverified
7	ResNet-18 + 4bit-4dim model compression using DKM	Top-1	66.1	—	Unverified
8	ResNet-18 + 2bit-2dim model compression using DKM	Top-1	64.7	—	Unverified
9	MobileNet-v1 + 4bit-4dim model compression using DKM	Top-1	61.4	—	Unverified
10	ResNet-18 + 1bit-1dim model compression using DKM	Top-1	59.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MobileBERT + 2bit-1dim model compression using DKM	Accuracy	82.13	—	Unverified
2	MobileBERT + 1bit-1dim model compression using DKM	Accuracy	63.17	—	Unverified