Learning to Collide: Recommendation System Model Compression with Learned Hash Functions Mar 28, 2022 Model Compression
— Unverified 0Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks Mar 25, 2022 Incremental Learning Knowledge Distillation
Code Code Available 1Mitigating Gender Bias in Distilled Language Models via Counterfactual Role Reversal Mar 23, 2022 counterfactual Fairness
— Unverified 0DQ-BART: Efficient Sequence-to-Sequence Model via Joint Distillation and Quantization Mar 21, 2022 Knowledge Distillation Model Compression
Code Code Available 1Compression of Generative Pre-trained Language Models via Quantization Mar 21, 2022 Model Compression Quantization
— Unverified 0PublicCheck: Public Integrity Verification for Services of Run-time Deep Models Mar 21, 2022 Model Compression
— Unverified 0Learning Compressed Embeddings for On-Device Inference Mar 18, 2022 Model Compression Recommendation Systems
— Unverified 0A Closer Look at Knowledge Distillation with Features, Logits, and Gradients Mar 18, 2022 Incremental Learning Knowledge Distillation
— Unverified 0Approximability and Generalisation Mar 15, 2022 Learning Theory Model Compression
— Unverified 0A Mixed Integer Programming Approach for Verifying Properties of Binarized Neural Networks Mar 11, 2022 Collision Avoidance Model Compression
— Unverified 0An Empirical Study of Low Precision Quantization for TinyML Mar 10, 2022 BIG-bench Machine Learning Model Compression
— Unverified 0Don't Be So Dense: Sparse-to-Sparse GAN Training Without Sacrificing Performance Mar 5, 2022 Model Compression
— Unverified 0Structured Pruning is All You Need for Pruning CNNs at Initialization Mar 4, 2022 All Model Compression
— Unverified 0E-LANG: Energy-Based Joint Inferencing of Super and Swift Language Models Mar 1, 2022 Decision Making Model Compression
— Unverified 0KMIR: A Benchmark for Evaluating Knowledge Memorization, Identification and Reasoning Abilities of Language Models Feb 28, 2022 General Knowledge Memorization
— Unverified 0Multi-task Learning Approach for Modulation and Wireless Signal Classification for 5G and Beyond: Edge Deployment via Model Compression Feb 26, 2022 Management Model Compression
— Unverified 0A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation Feb 21, 2022 Knowledge Distillation Model Compression
— Unverified 0Time-Correlated Sparsification for Efficient Over-the-Air Model Aggregation in Wireless Federated Learning Feb 17, 2022 Federated Learning Model Compression
— Unverified 0A Survey on Model Compression and Acceleration for Pretrained Language Models Feb 15, 2022 Model Compression
— Unverified 0SPDY: Accurate Pruning with Speedup Guarantees Jan 31, 2022 GPU Model Compression
Code Code Available 1Memory-Efficient Backpropagation through Large Linear Layers Jan 31, 2022 Model Compression
Code Code Available 1Training Thinner and Deeper Neural Networks: Jumpstart Regularization Jan 30, 2022 Model Compression Quantization
Code Code Available 0AutoMC: Automated Model Compression based on Domain Knowledge and Progressive search strategy Jan 24, 2022 Model Compression
Code Code Available 0Enabling Deep Learning on Edge Devices through Filter Pruning and Knowledge Transfer Jan 22, 2022 image-classification Image Classification
— Unverified 0Can Model Compression Improve NLP Fairness Jan 21, 2022 Fairness Knowledge Distillation
— Unverified 0AutoDistill: an End-to-End Framework to Explore and Distill Hardware-Efficient Language Models Jan 21, 2022 Bayesian Optimization Knowledge Distillation
— Unverified 0High-fidelity 3D Model Compression based on Key Spheres Jan 19, 2022 Model Compression Object
Code Code Available 0PCEE-BERT: Accelerating BERT Inference via Patient and Confident Early Exiting Jan 16, 2022 Model Compression
— Unverified 0UDC: Unified DNAS for Compressible TinyML Models Jan 15, 2022 Model Compression Neural Architecture Search
— Unverified 0DeepSpeed-MoE: Advancing Mixture-of-Experts Inference and Training to Power Next-Generation AI Scale Jan 14, 2022 Decoder Mixture-of-Experts
Code Code Available 0ThreshNet: An Efficient DenseNet Using Threshold Mechanism to Reduce Connections Jan 9, 2022 image-classification Image Classification
Code Code Available 0Two-Pass End-to-End ASR Model Compression Jan 8, 2022 Decoder Knowledge Distillation
— Unverified 0The Effect of Model Compression on Fairness in Facial Expression Recognition Jan 5, 2022 Facial Expression Recognition Facial Expression Recognition (FER)
— Unverified 0Dreaming To Prune Image Deraining Networks Jan 1, 2022 Model Compression Rain Removal
— Unverified 0HODEC: Towards Efficient High-Order DEcomposed Convolutional Neural Networks Jan 1, 2022 Model Compression Vocal Bursts Intensity Prediction
— Unverified 0Croesus: Multi-Stage Processing and Transactions for Video-Analytics in Edge-Cloud Systems Dec 31, 2021 Model Compression object-detection
— Unverified 0Multi-Dimensional Model Compression of Vision Transformer Dec 31, 2021 model Model Compression
Code Code Available 0Conditional Generative Data-free Knowledge Distillation Dec 31, 2021 Conditional Image Generation Data-free Knowledge Distillation
— Unverified 0Data-Free Knowledge Transfer: A Survey Dec 31, 2021 Data-free Knowledge Distillation Domain Adaptation
— Unverified 0Finding the Task-Optimal Low-Bit Sub-Distribution in Deep Neural Networks Dec 30, 2021 CPU image-classification
Code Code Available 1Automatic Mixed-Precision Quantization Search of BERT Dec 30, 2021 Knowledge Distillation Model Compression
— Unverified 0SPViT: Enabling Faster Vision Transformers via Soft Token Pruning Dec 27, 2021 Efficient ViTs image-classification
Code Code Available 1LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision Dec 18, 2021 Knowledge Distillation Model Compression
— Unverified 0Pixel Distillation: A New Knowledge Distillation Scheme for Low-Resolution Image Recognition Dec 17, 2021 image-classification Image Classification
Code Code Available 1From Dense to Sparse: Contrastive Pruning for Better Pre-trained Language Model Compression Dec 14, 2021 Contrastive Learning Language Modeling
Code Code Available 0Knowledge Distillation for Object Detection via Rank Mimicking and Prediction-guided Feature Imitation Dec 9, 2021 image-classification Image Classification
— Unverified 0Low-rank Tensor Decomposition for Compression of Convolutional Neural Networks Using Funnel Regularization Dec 7, 2021 global-optimization Model Compression
— Unverified 0Finding Deviated Behaviors of the Compressed DNN Models for Image Classifications Dec 6, 2021 image-classification Image Classification
Code Code Available 0Toward Real-World Voice Disorder Classification Dec 5, 2021 Classification Model Compression
— Unverified 0Shapeshifter: a Parameter-efficient Transformer using Factorized Reshaped Matrices Dec 1, 2021 Knowledge Distillation Model Compression
Code Code Available 0