MobileNMT: Enabling Translation in 15MB and 30ms Jun 7, 2023 Model Compression NMT
Code Code Available 1Modular Transformers: Compressing Transformers into Modularized Layers for Flexible Efficient Inference Jun 4, 2023 Decoder Knowledge Distillation
— Unverified 0Riemannian Low-Rank Model Compression for Federated Learning with Over-the-Air Aggregation Jun 4, 2023 Federated Learning Model Compression
— Unverified 0Low-Complexity Acoustic Scene Classification Using Data Augmentation and Lightweight ResNet Jun 3, 2023 Acoustic Scene Classification Data Augmentation
— Unverified 0Group channel pruning and spatial attention distilling for object detection Jun 2, 2023 Knowledge Distillation Model Compression
— Unverified 0Task-Agnostic Structured Pruning of Speech Representation Models Jun 2, 2023 Model Compression
— Unverified 0LoRAPrune: Structured Pruning Meets Low-Rank Parameter-Efficient Fine-Tuning May 28, 2023 Model Compression Network Pruning
Code Code Available 1ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval May 28, 2023 Image Retrieval Knowledge Distillation
— Unverified 0COMCAT: Towards Efficient Compression and Customization of Attention-Based Vision Models May 26, 2023 Model Compression
Code Code Available 12-bit Conformer quantization for automatic speech recognition May 26, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0RAND: Robustness Aware Norm Decay For Quantized Seq2seq Models May 24, 2023 Machine Translation Model Compression
— Unverified 0An Efficient Multilingual Language Model Compression through Vocabulary Trimming May 24, 2023 Language Modeling Language Modelling
Code Code Available 1PruMUX: Augmenting Data Multiplexing with Model Compression May 24, 2023 Knowledge Distillation model
Code Code Available 0Selective Pre-training for Private Fine-tuning May 23, 2023 Model Compression Transfer Learning
Code Code Available 0Revisiting Data Augmentation in Model Compression: An Empirical and Comprehensive Study May 22, 2023 Data Augmentation Knowledge Distillation
— Unverified 0Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt May 17, 2023 GPU Model Compression
— Unverified 0AD-KD: Attribution-Driven Knowledge Distillation for Language Model Compression May 17, 2023 Knowledge Distillation Language Modeling
Code Code Available 1Towards Understanding and Improving Knowledge Distillation for Neural Machine Translation May 14, 2023 Knowledge Distillation Machine Translation
Code Code Available 0GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples May 13, 2023 Binarization Knowledge Distillation
Code Code Available 0CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation May 8, 2023 GPU Model Compression
— Unverified 0Redundancy and Concept Analysis for Code-trained Language Models May 1, 2023 Memorization Model Compression
— Unverified 0CORSD: Class-Oriented Relational Self Distillation Apr 28, 2023 Knowledge Distillation Model Compression
— Unverified 0Guaranteed Quantization Error Computation for Neural Network Model Compression Apr 26, 2023 Model Compression Neural Network Compression
— Unverified 0Class Attention Transfer Based Knowledge Distillation Apr 25, 2023 Knowledge Distillation Model Compression
Code Code Available 1Bias in Pruned Vision Models: In-Depth Analysis and Countermeasures Apr 25, 2023 Model Compression Network Pruning
— Unverified 0Deep Collective Knowledge Distillation Apr 18, 2023 Knowledge Distillation Model Compression
— Unverified 0Learning Accurate Performance Predictors for Ultrafast Automated Model Compression Apr 13, 2023 image-classification Image Classification
Code Code Available 0Structured Pruning for Multi-Task Deep Neural Networks Apr 13, 2023 Model Compression
— Unverified 0Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning Apr 8, 2023 image-classification Image Classification
— Unverified 0oBERTa: Improving Sparse Transfer Learning via improved initialization, distillation, and pruning regimes Mar 30, 2023 Knowledge Distillation Model Compression
— Unverified 0A Multi-objective Complex Network Pruning Framework Based on Divide-and-conquer and Global Performance Impairment Ranking Mar 28, 2023 Model Compression Network Pruning
— Unverified 0Tetra-AML: Automatic Machine Learning via Tensor Networks Mar 28, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0Information-Theoretic GAN Compression with Variational Energy-based Model Mar 28, 2023 Image Enhancement Knowledge Distillation
— Unverified 0Towards Accurate Post-Training Quantization for Vision Transformer Mar 25, 2023 Model Compression Quantization
— Unverified 0Low Rank Optimization for Efficient Deep Learning: Making A Balance between Compact Architecture and Fast Training Mar 22, 2023 Model Compression Quantization
— Unverified 0Exploring Turkish Speech Recognition via Hybrid CTC/Attention Architecture and Multi-feature Fusion Network Mar 22, 2023 Model Compression speech-recognition
— Unverified 0Performance-aware Approximation of Global Channel Pruning for Multitask CNNs Mar 21, 2023 Model Compression
Code Code Available 1The Tiny Time-series Transformer: Low-latency High-throughput Classification of Astronomical Transients using Deep Model Compression Mar 15, 2023 Astronomy Model Compression
Code Code Available 1R2 Loss: Range Restriction Loss for Model Compression and Quantization Mar 14, 2023 Classification Model Compression
— Unverified 0I3D: Transformer architectures with input-dependent dynamic depth for speech recognition Mar 14, 2023 Model Compression speech-recognition
Code Code Available 0A Contrastive Knowledge Transfer Framework for Model Compression and Transfer Learning Mar 14, 2023 image-classification Image Classification
Code Code Available 0OTOV2: Automatic, Generic, User-Friendly Mar 13, 2023 Model Compression
— Unverified 0On Model Compression for Neural Networks: Framework, Algorithm, and Convergence Guarantee Mar 13, 2023 Image Classification Model Compression
Code Code Available 0Greener yet Powerful: Taming Large Code Generation Models with Quantization Mar 9, 2023 Code Generation Code Summarization
— Unverified 0Gradient-Free Structured Pruning with Unlabeled Data Mar 7, 2023 GPU Model Compression
— Unverified 0Rotation Invariant Quantization for Model Compression Mar 3, 2023 model Model Compression
Code Code Available 0Adversarial Attacks on Machine Learning in Embedded and IoT Platforms Mar 3, 2023 Adversarial Robustness Model Compression
— Unverified 0Towards domain generalisation in ASR with elitist sampling and ensemble knowledge distillation Mar 1, 2023 Domain Adaptation Knowledge Distillation
— Unverified 0Structured Pruning of Self-Supervised Pre-trained Models for Speech Recognition and Understanding Feb 27, 2023 Model Compression Representation Learning
Code Code Available 1Debiased Distillation by Transplanting the Last Layer Feb 22, 2023 Attribute Knowledge Distillation
— Unverified 0