Explainability-Driven Leaf Disease Classification Using Adversarial Training and Knowledge Distillation Dec 30, 2023 Adversarial Attack Classification
— Unverified 0Explaining Sequence-Level Knowledge Distillation as Data-Augmentation for Neural Machine Translation Dec 6, 2019 Data Augmentation Knowledge Distillation
— Unverified 0Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 0Exploiting Domain Knowledge via Grouped Weight Sharing with Application to Text Categorization Feb 8, 2017 General Classification Model Compression
— Unverified 0Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices Mar 10, 2025 CPU GPU
— Unverified 0Exploiting Non-Linear Redundancy for Neural Model Compression May 28, 2020 model Model Compression
— Unverified 0GeneCAI: Genetic Evolution for Acquiring Compact AI Apr 8, 2020 GPU Model Compression
— Unverified 0Exploration and Estimation for Model Compression Jan 1, 2021 model Model Compression
— Unverified 0GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference Dec 23, 2024 GPU Language Modeling
— Unverified 0Artemis: HE-Aware Training for Efficient Privacy-Preserving Machine Learning Oct 2, 2023 Model Compression Privacy Preserving
— Unverified 0Data-Driven Compression of Convolutional Neural Networks Nov 28, 2019 Knowledge Distillation Model Compression
— Unverified 0A Unified Knowledge Distillation Framework for Deep Directed Graphical Models Sep 29, 2021 Continual Learning Federated Learning
— Unverified 0DarkRank: Accelerating Deep Metric Learning via Cross Sample Similarities Transfer Jul 5, 2017 Clustering Image Clustering
— Unverified 0DARC: Differentiable ARchitecture Compression May 20, 2019 image-classification Image Classification
— Unverified 0A Unified Framework of DNN Weight Pruning and Weight Clustering/Quantization Using ADMM Nov 5, 2018 Clustering Model Compression
— Unverified 0Aligned Weight Regularizers for Pruning Pretrained Neural Networks Apr 4, 2022 Language Modelling Model Compression
— Unverified 0DARB: A Density-Aware Regular-Block Pruning for Deep Neural Networks Nov 19, 2019 Model Compression Network Pruning
— Unverified 0D^2MoE: Dual Routing and Dynamic Scheduling for Efficient On-Device MoE-based LLM Serving Apr 17, 2025 Mixture-of-Experts Model Compression
— Unverified 0A Unified Approximation Framework for Compressing and Accelerating Deep Neural Networks Jul 26, 2018 General Classification image-classification
— Unverified 0CURing Large Models: Compression via CUR Decomposition Jan 8, 2025 Model Compression
— Unverified 0CSTAR: Towards Compact and STructured Deep Neural Networks with Adversarial Robustness Dec 4, 2022 Adversarial Robustness Model Compression
— Unverified 0Augmenting Knowledge Distillation With Peer-To-Peer Mutual Learning For Model Compression Oct 21, 2021 Knowledge Distillation Model Compression
— Unverified 0Artificial Neural Networks for Photonic Applications: From Algorithms to Implementation Aug 2, 2024 Model Compression
— Unverified 0CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression Oct 10, 2024 Language Modeling Language Modelling
— Unverified 0Deep Face Recognition Model Compression via Knowledge Transfer and Distillation Jun 3, 2019 Face Recognition Knowledge Distillation
— Unverified 0From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs Apr 18, 2025 Knowledge Distillation Model Compression
— Unverified 0Cross Domain Model Compression by Structurally Weight Sharing Jun 1, 2019 Action Recognition Graph Embedding
— Unverified 0Inferring ECG from PPG for Continuous Cardiac Monitoring Using Lightweight Neural Network Dec 9, 2020 Model Compression
— Unverified 0From Word Vectors to Multimodal Embeddings: Techniques, Applications, and Future Directions For Large Language Models Nov 6, 2024 Model Compression Sentence
— Unverified 0Cross-Channel Intragroup Sparsity Neural Network Oct 26, 2019 Model Compression Network Pruning
— Unverified 0Croesus: Multi-Stage Processing and Transactions for Video-Analytics in Edge-Cloud Systems Dec 31, 2021 Model Compression object-detection
— Unverified 0Attention Sinks and Outlier Features: A 'Catch, Tag, and Release' Mechanism for Embeddings Feb 2, 2025 Model Compression TAG
— Unverified 0Creating Lightweight Object Detectors with Model Compression for Deployment on Edge Devices May 6, 2019 Knowledge Distillation Model Compression
— Unverified 0CPTQuant -- A Novel Mixed Precision Post-Training Quantization Techniques for Large Language Models Dec 3, 2024 Language Modeling Language Modelling
— Unverified 0ALF: Autoencoder-based Low-rank Filter-sharing for Efficient Convolutional Neural Networks Jul 27, 2020 Model Compression
— Unverified 0AACP: Model Compression by Accurate and Automatic Channel Pruning Jan 31, 2021 Model Compression Neural Architecture Search
— Unverified 0Frustratingly Easy Model Ensemble for Abstractive Summarization Oct 1, 2018 Abstractive Text Summarization Density Estimation
— Unverified 0FSCNN: A Fast Sparse Convolution Neural Network Inference System Dec 17, 2022 Model Compression
— Unverified 0Atrial Fibrillation Detection Using Weight-Pruned, Log-Quantised Convolutional Neural Networks Jun 14, 2022 Atrial Fibrillation Detection Model Compression
— Unverified 0âLearning-Compressionâ Algorithms for Neural Net Pruning Jun 1, 2018 Model Compression Network Pruning
— Unverified 0Integrating Fairness and Model Pruning Through Bi-level Optimization Dec 15, 2023 Fairness Model Compression
— Unverified 0CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction Dec 23, 2024 3DGS GPU
— Unverified 0Atomic Compression Networks Sep 25, 2019 Model Compression
— Unverified 0Fragile Mastery: Are Domain-Specific Trade-Offs Undermining On-Device Language Models? Mar 16, 2025 Model Compression Raspberry Pi 4
— Unverified 0Atleus: Accelerating Transformers on the Edge Enabled by 3D Heterogeneous Manycore Architectures Jan 16, 2025 Model Compression Quantization
— Unverified 0Cosine Similarity Knowledge Distillation for Individual Class Information Transfer Nov 24, 2023 Knowledge Distillation Model Compression
— Unverified 0Spike-and-slab shrinkage priors for structurally sparse Bayesian neural networks Aug 17, 2023 Computational Efficiency Model Compression
— Unverified 0CORSD: Class-Oriented Relational Self Distillation Apr 28, 2023 Knowledge Distillation Model Compression
— Unverified 0A Theoretical Understanding of Neural Network Compression from Sparse Linear Approximation Jun 11, 2022 Model Compression Neural Network Compression
— Unverified 0A Half-Space Stochastic Projected Gradient Method for Group Sparsity Regularization Jan 1, 2021 compressed sensing feature selection
— Unverified 0