Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 0Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices Mar 10, 2025 CPU GPU
— Unverified 0Compressed models are NOT miniature versions of large models Jul 18, 2024 Adversarial Attack Model Compression
— Unverified 0Artemis: HE-Aware Training for Efficient Privacy-Preserving Machine Learning Oct 2, 2023 Model Compression Privacy Preserving
— Unverified 0A Novel Architecture Slimming Method for Network Pruning and Knowledge Distillation Feb 21, 2022 Knowledge Distillation Model Compression
— Unverified 0Adaptive Learning of Tensor Network Structures Aug 12, 2020 BIG-bench Machine Learning Model Compression
— Unverified 0Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models May 10, 2024 AI Agent Model Compression
— Unverified 0Accelerating Framework of Transformer by Hardware Design and Model Compression Co-Optimization Oct 19, 2021 CPU GPU
— Unverified 0DeepRebirth: Accelerating Deep Neural Network Execution on Mobile Devices Aug 16, 2017 CPU Model Compression
— Unverified 0Channel Compression: Rethinking Information Redundancy among Channels in CNN Architecture Jul 2, 2020 Acoustic Scene Classification Event Detection
— Unverified 0Deep Model Compression Via Two-Stage Deep Reinforcement Learning Dec 4, 2019 Autonomous Driving Deep Reinforcement Learning
— Unverified 0An Improving Framework of regularization for Network Compression Dec 11, 2019 Model Compression object-detection
— Unverified 0Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN Mar 26, 2024 Knowledge Distillation Model Compression
— Unverified 0Adaptive Quantization of Neural Networks Jan 1, 2018 Edge-computing Model Compression
— Unverified 0Neural Epitome Search for Architecture-Agnostic Network Compression Jul 12, 2019 channel selection Model Compression
— Unverified 0Extending DeepSDF for automatic 3D shape retrieval and similarity transform estimation Apr 20, 2020 3D Shape Classification 3D Shape Retrieval
— Unverified 0Accelerating deep neural networks for efficient scene understanding in automotive cyber-physical systems Jul 19, 2021 Model Compression object-detection
— Unverified 0Adaptive Neural Connections for Sparsity Learning Mar 5, 2020 Model Compression Network Pruning
— Unverified 0Deep learning model compression using network sensitivity and gradients Oct 11, 2022 Deep Learning Model Compression
— Unverified 0Cascaded channel pruning using hierarchical self-distillation Aug 16, 2020 Knowledge Distillation Model Compression
— Unverified 0Can We Find Strong Lottery Tickets in Generative Models? Dec 16, 2022 Model Compression Network Pruning
— Unverified 0A New Clustering-Based Technique for the Acceleration of Deep Convolutional Networks Jul 19, 2021 Clustering image-classification
— Unverified 0Deep Model Compression based on the Training History Jan 30, 2021 model Model Compression
— Unverified 0Can Students Outperform Teachers in Knowledge Distillation based Model Compression? Jan 1, 2021 Knowledge Distillation Model Compression
— Unverified 0Can Students Beyond The Teacher? Distilling Knowledge from Teacher's Bias Dec 13, 2024 Knowledge Distillation Model Compression
— Unverified 0A "Network Pruning Network" Approach to Deep Model Compression Jan 15, 2020 Knowledge Distillation Model Compression
— Unverified 0An Empirical Study of Low Precision Quantization for TinyML Mar 10, 2022 BIG-bench Machine Learning Model Compression
— Unverified 0Can Model Compression Improve NLP Fairness Jan 21, 2022 Fairness Knowledge Distillation
— Unverified 0Heterogeneous Federated Learning using Dynamic Model Pruning and Adaptive Gradient Jun 13, 2021 Federated Learning Model Compression
— Unverified 02-bit Model Compression of Deep Convolutional Neural Network on ASIC Engine for Image Retrieval May 8, 2019 Image Retrieval Model Compression
— Unverified 0Deep Model Compression: Distilling Knowledge from Noisy Teachers Oct 30, 2016 Model Compression
— Unverified 0DeepTwist: Learning Model Compression via Occasional Weight Distortion Oct 30, 2018 model Model Compression
— Unverified 0Can collaborative learning be private, robust and scalable? May 5, 2022 Adversarial Robustness Federated Learning
— Unverified 0CAIT: Triple-Win Compression towards High Accuracy, Fast Inference, and Favorable Transferability For ViTs Sep 27, 2023 Model Compression Semantic Segmentation
— Unverified 0Multihop: Leveraging Complex Models to Learn Accurate Simple Models Sep 14, 2021 Explainable artificial intelligence Knowledge Distillation
— Unverified 0Bringing AI To Edge: From Deep Learning's Perspective Nov 25, 2020 Deep Learning Edge-computing
— Unverified 0An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers Jun 17, 2024 Model Compression text-classification
— Unverified 0Adapting Models to Signal Degradation using Distillation Apr 1, 2016 Domain Adaptation Knowledge Distillation
— Unverified 0BRIEDGE: EEG-Adaptive Edge AI for Multi-Brain to Multi-Robot Interaction Mar 14, 2024 EEG Model Compression
— Unverified 0Bridging the Resource Gap: Deploying Advanced Imitation Learning Models onto Affordable Embedded Platforms Nov 18, 2024 Imitation Learning Model Compression
— Unverified 0A Multi-objective Complex Network Pruning Framework Based on Divide-and-conquer and Global Performance Impairment Ranking Mar 28, 2023 Model Compression Network Pruning
— Unverified 0Bridging the Gap Between Foundation Models and Heterogeneous Federated Learning Sep 30, 2023 Federated Learning Model Compression
— Unverified 0An Embedded Deep Learning Object Detection Model For Traffic In Asian Countries Jun 9, 2020 Deep Learning Model Compression
— Unverified 0AdapMTL: Adaptive Pruning Framework for Multitask Learning Model Aug 7, 2024 model Model Compression
— Unverified 0Accelerating Deep Learning with Dynamic Data Pruning Nov 24, 2021 Attribute Deep Learning
— Unverified 0Deep Compression of Neural Networks for Fault Detection on Tennessee Eastman Chemical Processes Jan 18, 2021 Clustering Fault Detection
— Unverified 0Boosting Graph Neural Networks via Adaptive Knowledge Distillation Oct 12, 2022 Graph Classification Graph Mining
— Unverified 0Block-wise Intermediate Representation Training for Model Compression Oct 20, 2018 Knowledge Distillation model
— Unverified 0DopQ-ViT: Towards Distribution-Friendly and Outlier-Aware Post-Training Quantization for Vision Transformers Aug 6, 2024 Model Compression Quantization
— Unverified 0Block Skim Transformer for Efficient Question Answering Jan 1, 2021 Language Modeling Language Modelling
— Unverified 0