SOTAVerified

Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Showing 391400 of 1356 papers

TitleStatusHype
Computer Vision Model Compression Techniques for Embedded Systems: A SurveyCode0
Faithful Label-free Knowledge DistillationCode0
Finding Deviated Behaviors of the Compressed DNN Models for Image ClassificationsCode0
Explicit-NeRF-QA: A Quality Assessment Database for Explicit NeRF Model CompressionCode0
Exploiting Kernel Sparsity and Entropy for Interpretable CNN CompressionCode0
On Model Compression for Neural Networks: Framework, Algorithm, and Convergence GuaranteeCode0
Adversarial Robustness vs. Model Compression, or Both?Code0
Exact Backpropagation in Binary Weighted Networks with Group Weight TransformationsCode0
Exploring Gradient Flow Based Saliency for DNN Model CompressionCode0
Enhancing Knowledge Distillation of Large Language Models through Efficient Multi-Modal Distribution AlignmentCode0
Show:102550
← PrevPage 40 of 136Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MobileBERT + 2bit-1dim model compression using DKMAccuracy82.13Unverified
2MobileBERT + 1bit-1dim model compression using DKMAccuracy63.17Unverified