Knowledge Distillation with Multi-granularity Mixture of Priors for Image Super-Resolution Apr 3, 2024 Image Super-Resolution Knowledge Distillation
— Unverified 00 A Short Study on Compressing Decoder-Based Language Models Oct 16, 2021 Decoder Knowledge Distillation
— Unverified 00 Representative Teacher Keys for Knowledge Distillation Model Compression Based on Attention Mechanism for Image Classification Jun 26, 2022 GPU image-classification
— Unverified 00 The Lottery LLM Hypothesis, Rethinking What Abilities Should LLM Compression Preserve? Feb 24, 2025 Arithmetic Reasoning Common Sense Reasoning
— Unverified 00 Accelerating Machine Learning Primitives on Commodity Hardware Oct 8, 2023 CPU Model Compression
— Unverified 00 ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization Nov 12, 2024 Language Modeling Language Modelling
— Unverified 00 Theoretical Guarantees for Low-Rank Compression of Deep Neural Networks Feb 4, 2025 Low-rank compression Model Compression
— Unverified 00 Know What You Don't Need: Single-Shot Meta-Pruning for Attention Heads Nov 7, 2020 Informativeness Meta-Learning
— Unverified 00 KroneckerBERT: Learning Kronecker Decomposition for Pre-trained Language Models via Knowledge Distillation Sep 13, 2021 Knowledge Distillation Language Modeling
— Unverified 00 KroneckerBERT: Significant Compression of Pre-trained Language Models Through Kronecker Decomposition and Knowledge Distillation Jul 1, 2022 Knowledge Distillation Language Modeling
— Unverified 00 Kronecker Decomposition for GPT Compression Oct 15, 2021 Knowledge Distillation Language Modeling
— Unverified 00 L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models Feb 7, 2024 Few-Shot Learning In-Context Learning
— Unverified 00 LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression Apr 8, 2020 Blocking Knowledge Distillation
— Unverified 00 Language model compression with weighted low-rank factorization Jun 30, 2022 Language Modeling Language Modelling
— Unverified 00 The Potential of AutoML for Recommender Systems Feb 6, 2024 AutoML Machine Translation
— Unverified 00 Large Language Model Compression via the Nested Activation-Aware Decomposition Mar 21, 2025 Language Modeling Language Modelling
— Unverified 00 Wasserstein Contrastive Representation Distillation Dec 15, 2020 Contrastive Learning Knowledge Distillation
— Unverified 00 Large receptive field strategy and important feature extraction strategy in 3D object detection Jan 22, 2024 3D Object Detection Autonomous Driving
— Unverified 00 Large-Scale Generative Data-Free Distillation Dec 10, 2020 Knowledge Distillation Model Compression
— Unverified 00 LatentLLM: Attention-Aware Joint Tensor Compression May 23, 2025 Model Compression Tensor Decomposition
— Unverified 00 LayerCollapse: Adaptive compression of neural networks Nov 29, 2023 Computational Efficiency image-classification
— Unverified 00 Layer-specific Optimization for Mixed Data Flow with Mixed Precision in FPGA Design for CNN-based Object Detectors Sep 3, 2020 Bayesian Optimization Model Compression
— Unverified 00 LCQ: Low-Rank Codebook based Quantization for Large Language Models May 31, 2024 Model Compression Quantization
— Unverified 00 A Selective Survey on Versatile Knowledge Distillation Paradigm for Neural Network Models Nov 30, 2020 Knowledge Distillation Model Compression
— Unverified 00 A Scale Mixture Perspective of Multiplicative Noise in Neural Networks Jun 10, 2015 Model Compression
— Unverified 00 Watermarking Graph Neural Networks by Random Graphs Nov 1, 2020 Graph Neural Network Model Compression
— Unverified 00 Learning a Neural Diff for Speech Models Aug 3, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Learning-Based Symbol Level Precoding: A Memory-Efficient Unsupervised Learning Approach Nov 15, 2021 Model Compression
— Unverified 00 Learning Compressed Embeddings for On-Device Inference Mar 18, 2022 Model Compression Recommendation Systems
— Unverified 00 Accelerating Linear Recurrent Neural Networks for the Edge with Unstructured Sparsity Feb 3, 2025 Audio Denoising Denoising
— Unverified 00 Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization Jun 16, 2022 Language Modeling Language Modelling
— Unverified 00 Learning Disentangled Representation with Mutual Information Maximization for Real-Time UAV Tracking Aug 20, 2023 CPU Model Compression
— Unverified 00 Three Dimensional Convolutional Neural Network Pruning with Regularization-Based Method Nov 19, 2018 Model Compression Network Pruning
— Unverified 00 Learning Efficient Image Super-Resolution Networks via Structure-Regularized Pruning Sep 29, 2021 Image Super-Resolution Knowledge Distillation
— Unverified 00 Learning Efficient Object Detection Models with Knowledge Distillation Dec 1, 2017 Knowledge Distillation Model Compression
— Unverified 00 ASCAI: Adaptive Sampling for acquiring Compact AI Nov 15, 2019 Model Compression Reinforcement Learning
— Unverified 00 Learning by Sampling and Compressing: Efficient Graph Representation Learning with Extremely Limited Annotations Mar 13, 2020 Graph Embedding Graph Representation Learning
— Unverified 00 MixMix: All You Need for Data-Free Compression Are Feature and Data Mixing Nov 19, 2020 All Knowledge Distillation
— Unverified 00 Learning Interpretation with Explainable Knowledge Distillation Nov 12, 2021 Knowledge Distillation Model Compression
— Unverified 00 WeClick: Weakly-Supervised Video Semantic Segmentation with Click Annotations Jul 7, 2021 Knowledge Distillation Model Compression
— Unverified 00 Learning Low-Rank Approximation for CNNs May 24, 2019 Model Compression
— Unverified 00 Learning Low-Rank Representations for Model Compression Nov 21, 2022 Clustering model
— Unverified 00 Model Compression Method for S4 with Diagonal State Space Layers using Balanced Truncation Feb 25, 2024 Model Compression
— Unverified 00 Artemis: HE-Aware Training for Efficient Privacy-Preserving Machine Learning Oct 2, 2023 Model Compression Privacy Preserving
— Unverified 00 Learning to Collide: Recommendation System Model Compression with Learned Hash Functions Mar 28, 2022 Model Compression
— Unverified 00 Learning to Prune Deep Neural Networks via Reinforcement Learning Jul 9, 2020 Deep Reinforcement Learning Model Compression
— Unverified 00 Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models Oct 24, 2022 Knowledge Distillation Model Compression
— Unverified 00 LegoDNN: Block-grained Scaling of Deep Neural Networks for Mobile Vision Dec 18, 2021 Knowledge Distillation Model Compression
— Unverified 00 Tight Compression: Compressing CNN Through Fine-Grained Pruning and Weight Permutation for Efficient Implementation Apr 3, 2021 Model Compression
— Unverified 00 Are We There Yet? A Measurement Study of Efficiency for LLM Applications on Mobile Devices Mar 10, 2025 CPU GPU
— Unverified 00