SOTAVerified

Model Compression

Model Compression is an actively pursued area of research over the last few years with the goal of deploying state-of-the-art deep networks in low-power and resource limited devices without significant drop in accuracy. Parameter pruning, low-rank factorization and weight quantization are some of the proposed methods to compress the size of deep networks.

Source: KD-MRI: A knowledge distillation framework for image reconstruction and image restoration in MRI workflow

Papers

Showing 651700 of 1356 papers

TitleStatusHype
InfantCryNet: A Data-driven Framework for Intelligent Analysis of Infant Cries0
Inference Optimization of Foundation Models on AI Accelerators0
Information-Theoretic GAN Compression with Variational Energy-based Model0
Tetra-AML: Automatic Machine Learning via Tensor Networks0
Infra-YOLO: Efficient Neural Network Structure with Model Compression for Real-Time Infrared Small Object Detection0
InhibiDistilbert: Knowledge Distillation for a ReLU and Addition-based Transformer0
A Survey of Small Language Models0
INSIGHT: A Survey of In-Network Systems for Intelligent, High-Efficiency AI and Topology Optimization0
Instance-Aware Group Quantization for Vision Transformers0
Integral Pruning on Activations and Weights for Efficient Neural Networks0
PublicCheck: Public Integrity Verification for Services of Run-time Deep Models0
Interpreting Deep Classifier by Visual Distillation of Dark Knowledge0
Redundancy and Concept Analysis for Code-trained Language Models0
Intrinsically Sparse Long Short-Term Memory Networks0
Investigation of Practical Aspects of Single Channel Speech Separation for ASR0
TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models0
Is Quantum Optimization Ready? An Effort Towards Neural Network Compression using Adiabatic Quantum Computing0
IteRABRe: Iterative Recovery-Aided Block Reduction0
Iterative Compression of End-to-End ASR Model using AutoML0
2-bit Model Compression of Deep Convolutional Neural Network on ASIC Engine for Image Retrieval0
It's always personal: Using Early Exits for Efficient On-Device CNN Personalisation0
The Effect of Model Compression on Fairness in Facial Expression Recognition0
A Survey of Model Compression and Acceleration for Deep Neural Networks0
Joint Neural Architecture Search and Quantization0
Joint Regularization on Activations and Weights for Efficient Neural Network Pruning0
KDH-MLTC: Knowledge Distillation for Healthcare Multi-Label Text Classification0
A Survey of Mobile Computing for the Visually Impaired0
Enabling Retrain-free Deep Neural Network Pruning using Surrogate Lagrangian Relaxation0
Kernel Modulation: A Parameter-Efficient Method for Training Convolutional Neural Networks0
KIMERA: Injecting Domain Knowledge into Vacant Transformer Heads0
KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models0
A Speed Odyssey for Deployable Quantization of LLMs0
VQ4ALL: Efficient Neural Network Representation via a Universal Codebook0
Knowledge Distillation: A Survey0
Knowledge Distillation Based Semantic Communications For Multiple Users0
Knowledge Distillation Beyond Model Compression0
The Impact of Quantization and Pruning on Deep Reinforcement Learning Models0
Knowledge Distillation for Image Restoration : Simultaneous Learning from Degraded and Clean Images0
Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation0
Knowledge Distillation for Object Detection: from generic to remote sensing datasets0
Knowledge Distillation for Oriented Object Detection on Aerial Images0
The Knowledge Within: Methods for Data-Free Model Compression0
Knowledge Distillation for Swedish NER models: A Search for Performance and Efficiency0
Knowledge Distillation in Federated Learning: a Survey on Long Lasting Challenges and New Solutions0
Knowledge Distillation in Vision Transformers: A Critical Review0
Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher0
A SOT-MRAM-based Processing-In-Memory Engine for Highly Compressed DNN Implementation0
Knowledge Distillation on Graphs: A Survey0
Knowledge distillation via adaptive instance normalization0
Knowledge distillation via softmax regression representation learning0
Show:102550
← PrevPage 14 of 28Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MobileBERT + 2bit-1dim model compression using DKMAccuracy82.13Unverified
2MobileBERT + 1bit-1dim model compression using DKMAccuracy63.17Unverified