PruMUX: Augmenting Data Multiplexing with Model Compression May 24, 2023 Knowledge Distillation model
Code Code Available 0RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0Selective Pre-training for Private Fine-tuning May 23, 2023 Model Compression Transfer Learning
Code Code Available 0Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study May 22, 2023 Data Augmentation Knowledge Distillation
— Unverified 0Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt May 17, 2023 GPU Model Compression
— Unverified 0Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation May 14, 2023 Knowledge Distillation Machine Translation
Code Code Available 0GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples May 13, 2023 Binarization Knowledge Distillation
Code Code Available 0CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation May 8, 2023 GPU Model Compression
— Unverified 0Redundancy and Concept Analysis for Code-trained Language Models May 1, 2023 Memorization Model Compression
— Unverified 0CORSD: Class-Oriented Relational Self Distillation Apr 28, 2023 Knowledge Distillation Model Compression
— Unverified 0Guaranteed Quantization Error Computation for Neural Network Model Compression Apr 26, 2023 Model Compression Neural Network Compression
— Unverified 0Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures Apr 25, 2023 Model Compression Network Pruning
— Unverified 0Deep Collective Knowledge Distillation Apr 18, 2023 Knowledge Distillation Model Compression
— Unverified 0Learning Accurate Performance Predictors for Ultrafast Automated Model Compression Apr 13, 2023 image-classification Image Classification
Code Code Available 0Structured Pruning for Multi-Task Deep Neural Networks Apr 13, 2023 Model Compression
— Unverified 0Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning Apr 8, 2023 image-classification Image Classification
— Unverified 0oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes Mar 30, 2023 Knowledge Distillation Model Compression
— Unverified 0Information-Theoretic GAN Compression with Variational Energy-based Model Mar 28, 2023 Image Enhancement Knowledge Distillation
— Unverified 0A Multi-objective Complex Network Pruning Framework Based on Divide-and-conquer and Global Performance Impairment Ranking Mar 28, 2023 Model Compression Network Pruning
— Unverified 0Tetra-AML: Automatic Machine Learning via Tensor Networks Mar 28, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Towards Accurate Post-Training Quantization for Vision Transformer Mar 25, 2023 Model Compression Quantization
— Unverified 0Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network Mar 22, 2023 Model Compression speech-recognition
— Unverified 0Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training Mar 22, 2023 Model Compression Quantization
— Unverified 0I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Mar 14, 2023 Model Compression speech-recognition
Code Code Available 0R2 Loss: Range Restriction Loss for Model Compression and Quantization Mar 14, 2023 Classification Model Compression
— Unverified 0A Contrastive Knowledge Transfer Framework for Model Compression and Transfer Learning Mar 14, 2023 image-classification Image Classification
Code Code Available 0OTOV2: Automatic, Generic, User-Friendly Mar 13, 2023 Model Compression
— Unverified 0On Model Compression for Neural Networks: Framework, Algorithm, and Convergence Guarantee Mar 13, 2023 Image Classification Model Compression
Code Code Available 0Greener yet Powerful: Taming Large Code Generation Models with Quantization Mar 9, 2023 Code Generation Code Summarization
— Unverified 0Gradient-Free Structured Pruning with Unlabeled Data Mar 7, 2023 GPU Model Compression
— Unverified 0Rotation Invariant Quantization for Model Compression Mar 3, 2023 model Model Compression
Code Code Available 0Adversarial Attacks on Machine Learning in Embedded and IoT Platforms Mar 3, 2023 Adversarial Robustness Model Compression
— Unverified 0Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation Mar 1, 2023 Domain Adaptation Knowledge Distillation
— Unverified 0Debiased Distillation by Transplanting the Last Layer Feb 22, 2023 Attribute Knowledge Distillation
— Unverified 0Structured Bayesian Compression for Deep Neural Networks Based on The Turbo-VBI Approach Feb 21, 2023 Model Compression
— Unverified 0HomoDistil: Homotopic Task-Agnostic Distillation of Pre-trained Transformers Feb 19, 2023 Knowledge Distillation Model Compression
— Unverified 0A Comprehensive Review and a Taxonomy of Edge Machine Learning: Requirements, Paradigms, and Techniques Feb 16, 2023 Edge-computing Model Compression
— Unverified 0Towards Optimal Compression: Joint Pruning and Quantization Feb 15, 2023 Model Compression Neural Architecture Search
— Unverified 0On Achieving Privacy-Preserving State-of-the-Art Edge Intelligence Feb 10, 2023 Edge-computing Model Compression
— Unverified 0Knowledge Distillation in Vision Transformers: A Critical Review Feb 4, 2023 Decoder image-classification
— Unverified 0Generalized Uncertainty of Deep Neural Networks: Taxonomy and Applications Feb 2, 2023 Knowledge Distillation Model Compression
— Unverified 0Knowledge Distillation on Graphs: A Survey Feb 1, 2023 Knowledge Distillation Model Compression
— Unverified 0AMD: Adaptive Masked Distillation for Object Detection Jan 31, 2023 Knowledge Distillation Model Compression
— Unverified 0Improved knowledge distillation by utilizing backward pass knowledge in neural networks Jan 27, 2023 Knowledge Distillation Model Compression
— Unverified 0HALOC: Hardware-Aware Automatic Low-Rank Compression for Compact Neural Networks Jan 20, 2023 GPU Low-rank compression
— Unverified 0Accelerating and Compressing Deep Neural Networks for Massive MIMO CSI Feedback Jan 20, 2023 Model Compression Network Pruning
Code Code Available 0HCE: Improving Performance and Efficiency with Heterogeneously Compressed Neural Network Ensemble Jan 18, 2023 Diversity Ensemble Learning
— Unverified 0Distilling Focal Knowledge From Imperfect Expert for 3D Object Detection Jan 1, 2023 3D geometry 3D Object Detection
Code Code Available 0One-Shot Model for Mixed-Precision Quantization Jan 1, 2023 model Model Compression
— Unverified 0Tiny Updater: Towards Efficient Neural Network-Driven Software Updating Jan 1, 2023 Efficient Neural Network image-classification
Code Code Available 0