SOTAVerified

Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Showing 781790 of 1356 papers

TitleStatusHype
Analysis of memory consumption by neural networks based on hyperparameters0
Augmenting Knowledge Distillation With Peer-To-Peer Mutual Learning For Model Compression0
Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization0
Pro-KD: Progressive Distillation by Following the Footsteps of the Teacher0
A Short Study on Compressing Decoder-Based Language Models0
HRKD: Hierarchical Relational Knowledge Distillation for Cross-domain Language Model CompressionCode0
Robustness Challenges in Model Distillation and Pruning for Natural Language Understanding0
Differentiable Network Pruning for Microcontrollers0
Joint Channel and Weight Pruning for Model Acceleration on Moblie DevicesCode1
Kronecker Decomposition for GPT Compression0
Show:102550
← PrevPage 79 of 136Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MobileBERT + 2bit-1dim model compression using DKMAccuracy82.13Unverified
2MobileBERT + 1bit-1dim model compression using DKMAccuracy63.17Unverified