Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 621–630 of 1356 papers

Title	Date	Tasks	Status
A Survey on Model Compression for Large Language Models	Aug 15, 2023	BenchmarkingKnowledge Distillation	—Unverified
FedEdge AI-TC: A Semi-supervised Traffic Classification Method based on Trusted Federated Deep Learning for Mobile Edge Computing	Aug 14, 2023	Edge-computingFederated Learning	—Unverified
Resource Constrained Model Compression via Minimax Optimization for Spiking Neural Networks	Aug 9, 2023	Model CompressionSparse Learning	CodeCode Available
Accurate Neural Network Pruning Requires Rethinking Sparse Optimization	Aug 3, 2023	Model CompressionNetwork Pruning	—Unverified
MIMONet: Multi-Input Multi-Output On-Device Deep Learning	Jul 22, 2023	Deep LearningModel Compression	—Unverified
Model Compression Methods for YOLOv5: A Review	Jul 21, 2023	Knowledge Distillationmodel	—Unverified
Impact of Disentanglement on Pruning Neural Networks	Jul 19, 2023	DisentanglementModel Compression	—Unverified
Knowledge Distillation for Object Detection: from generic to remote sensing datasets	Jul 18, 2023	Knowledge DistillationModel Compression	—Unverified
CA-LoRA: Adapting Existing LoRA for Compressed LLMs to Enable Efficient Multi-Tasking on Personal Devices	Jul 15, 2023	Model Compression	CodeCode Available
Distilling Universal and Joint Knowledge for Cross-Domain Model Compression on Time Series Data	Jul 7, 2023	Knowledge DistillationModel Compression	CodeCode Available

Show:10 25 50

← PrevPage 63 of 136Next →

All datasets ImageNet QNLI

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	ADLIK-MO-ResNet50+W4A4	Top-1	77.88	—	Unverified
2	ADLIK-MO-ResNet50+W3A4	Top-1	77.34	—	Unverified
3	ResNet-18 + 4bit-1dim model compression using DKM	Top-1	70.52	—	Unverified
4	MobileNet-v1 + 4bit-1dim model compression using DKM	Top-1	69.63	—	Unverified
5	ResNet-18 + 2bit-1dim model compression using DKM	Top-1	68.63	—	Unverified
6	MobileNet-v1 + 2bit-1dim model compression using DKM	Top-1	67.62	—	Unverified
7	ResNet-18 + 4bit-4dim model compression using DKM	Top-1	66.1	—	Unverified
8	ResNet-18 + 2bit-2dim model compression using DKM	Top-1	64.7	—	Unverified
9	MobileNet-v1 + 4bit-4dim model compression using DKM	Top-1	61.4	—	Unverified
10	ResNet-18 + 1bit-1dim model compression using DKM	Top-1	59.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	MobileBERT + 2bit-1dim model compression using DKM	Accuracy	82.13	—	Unverified
2	MobileBERT + 1bit-1dim model compression using DKM	Accuracy	63.17	—	Unverified