Efficiency optimization of large-scale language models based on deep learning in natural language processing tasks May 20, 2024 Inference Optimization Knowledge Distillation
— Unverified 0Densely Distilling Cumulative Knowledge for Continual Learning May 16, 2024 All Continual Learning
— Unverified 0AdaKD: Dynamic Knowledge Distillation of ASR models using Adaptive Loss Weighting May 11, 2024 Knowledge Distillation Model Compression
— Unverified 0Characterizing the Accuracy -- Efficiency Trade-off of Low-rank Decomposition in Language Models May 10, 2024 AI Agent Model Compression
— Unverified 0From Algorithm to Hardware: A Survey on Efficient and Safe Deployment of Deep Neural Networks May 9, 2024 Knowledge Distillation Model Compression
— Unverified 0NurtureNet: A Multi-task Video-based Approach for Newborn Anthropometry May 9, 2024 Model Compression
— Unverified 0Light Field Compression Based on Implicit Neural Representation May 7, 2024 Model Compression
— Unverified 0Trio-ViT: Post-Training Quantization and Acceleration for Softmax-Free Efficient Vision Transformer May 6, 2024 Efficient ViTs Model Compression
Code Code Available 0Communication-Efficient Federated Learning with Adaptive Compression under Dynamic Bandwidth May 6, 2024 Federated Learning Model Compression
— Unverified 0Iterative Filter Pruning for Concatenation-based CNN Architectures May 4, 2024 Model Compression
Code Code Available 0Dependency-Aware Semi-Structured Sparsity of GLU Variants in Large Language Models May 3, 2024 Computational Efficiency Model Compression
— Unverified 0Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design May 2, 2024 Model Compression Neural Network Compression
Code Code Available 2FedGreen: Carbon-aware Federated Learning with Model Size Adaptation Apr 23, 2024 Federated Learning Model Compression
— Unverified 0Rapid Deployment of DNNs for Edge Computing via Structured Pruning at Initialization Apr 22, 2024 Edge-computing Model Compression
— Unverified 0Data-free Knowledge Distillation for Fine-grained Visual Categorization Apr 18, 2024 Data-free Knowledge Distillation Fine-Grained Visual Categorization
Code Code Available 0Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads Apr 17, 2024 Model Compression
— Unverified 0Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 0Structured Model Pruning for Efficient Inference in Computational Pathology Apr 12, 2024 Instance Segmentation Model Compression
— Unverified 0Transferable and Principled Efficiency for Open-Vocabulary Segmentation Apr 11, 2024 Model Compression
Code Code Available 1Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing Apr 11, 2024 Model Compression Object
— Unverified 0Bayesian Federated Model Compression for Communication and Computation Efficiency Apr 11, 2024 Bayesian Inference Federated Learning
— Unverified 0Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind Apr 6, 2024 Model Compression
Code Code Available 0Improve Knowledge Distillation via Label Revision and Data Selection Apr 3, 2024 Knowledge Distillation Model Compression
— Unverified 0Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution Apr 3, 2024 Image Super-Resolution Knowledge Distillation
— Unverified 0On Linearizing Structured Data in Encoder-Decoder Language Models: Insights from Text-to-SQL Apr 3, 2024 Decoder Knowledge Graphs
— Unverified 0Automated Inference of Graph Transformation Rules Apr 3, 2024 Model Compression
— Unverified 0Enhancing Inference Efficiency of Large Language Models: Investigating Optimization Strategies and Architectural Innovations Apr 2, 2024 Model Compression
— Unverified 0Instance-Aware Group Quantization for Vision Transformers Apr 1, 2024 image-classification Image Classification
— Unverified 0Streamlining Redundant Layers to Compress Large Language Models Mar 28, 2024 Model Compression
Code Code Available 1Is Modularity Transferable? A Case Study through the Lens of Knowledge Distillation Mar 27, 2024 Domain Adaptation Knowledge Distillation
Code Code Available 0Dense Vision Transformer Compression with Few Samples Mar 27, 2024 Model Compression
— Unverified 0Are Compressed Language Models Less Subgroup Robust? Mar 26, 2024 Model Compression
Code Code Available 0Tiny Models are the Computational Saver for Large Models Mar 26, 2024 Computational Efficiency Image Classification
Code Code Available 0Order of Compression: A Systematic and Optimal Sequence to Combinationally Compress CNN Mar 26, 2024 Knowledge Distillation Model Compression
— Unverified 0Magic for the Age of Quantized DNNs Mar 22, 2024 Model Compression Quantization
— Unverified 0Advancing IIoT with Over-the-Air Federated Learning: The Role of Iterative Magnitude Pruning Mar 21, 2024 Federated Learning Model Compression
— Unverified 0DiPaCo: Distributed Path Composition Mar 15, 2024 Language Modelling Model Compression
— Unverified 0BRIEDGE: EEG-Adaptive Edge AI for Multi-Brain to Multi-Robot Interaction Mar 14, 2024 EEG Model Compression
— Unverified 0PYRA: Parallel Yielding Re-Activation for Training-Inference Efficient Task Adaptation Mar 14, 2024 Model Compression parameter-efficient fine-tuning
Code Code Available 1Adversarial Fine-tuning of Compressed Neural Networks for Joint Improvement of Robustness and Efficiency Mar 14, 2024 Adversarial Robustness Model Compression
Code Code Available 0SVD-LLM: Truncation-aware Singular Value Decomposition for Large Language Model Compression Mar 12, 2024 Language Modeling Language Modelling
Code Code Available 3Maxwell's Demon at Work: Efficient Pruning by Leveraging Saturation of Neurons Mar 12, 2024 Continual Learning Model Compression
— Unverified 0Enhanced Sparsification via Stimulative Training Mar 11, 2024 Knowledge Distillation Model Compression
— Unverified 0Bit-mask Robust Contrastive Knowledge Distillation for Unsupervised Semantic Hashing Mar 10, 2024 Image Retrieval Knowledge Distillation
Code Code Available 1Optimal Policy Sparsification and Low Rank Decomposition for Deep Reinforcement Learning Mar 10, 2024 Deep Reinforcement Learning Edge-computing
— Unverified 0Towards efficient deep autoencoders for multivariate time series anomaly detection Mar 4, 2024 Anomaly Detection Model Compression
— Unverified 0DyCE: Dynamically Configurable Exiting for Deep Learning Compression and Real-time Scaling Mar 4, 2024 image-classification Image Classification
Code Code Available 0"Lossless" Compression of Deep Neural Networks: A High-dimensional Neural Tangent Kernel Approach Mar 1, 2024 Model Compression Quantization
Code Code Available 1Differentially Private Knowledge Distillation via Synthetic Text Generation Mar 1, 2024 Knowledge Distillation Model Compression
Code Code Available 0PromptMM: Multi-Modal Knowledge Distillation for Recommendation with Prompt-Tuning Feb 27, 2024 Knowledge Distillation Model Compression
Code Code Available 2