Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge Mar 12, 2025 CPU GPU
— Unverified 0SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching Oct 8, 2024 Model Compression Natural Language Understanding
— Unverified 0Sparse Deep Learning for Time Series Data: Theory and Applications Oct 5, 2023 Deep Learning Model Compression
— Unverified 0Sparse Unbalanced GAN Training with In-Time Over-Parameterization Sep 29, 2021 Model Compression
— Unverified 0Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks Apr 26, 2021 Model Compression Quantization
— Unverified 0Compressible Spectral Mixture Kernels with Sparse Dependency Structures for Gaussian Processes Aug 1, 2018 Gaussian Processes Model Compression
— Unverified 0Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error Aug 26, 2018 Edge-computing Learning Theory
— Unverified 0Speeding up Convolutional Neural Networks with Low Rank Expansions May 15, 2014 CPU GPU
— Unverified 0Speeding Up Image Classifiers with Little Companions Jun 24, 2024 image-classification Image Classification
— Unverified 0Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer Apr 29, 2021 General Knowledge Knowledge Distillation
— Unverified 0Sponge Attacks on Sensing AI: Energy-Latency Vulnerabilities and Defense via Model Pruning May 9, 2025 Model Compression
— Unverified 0SS-Auto: A Single-Shot, Automatic Structured Weight Pruning Framework of DNNs with Ultra-High Efficiency Jan 23, 2020 Model Compression
— Unverified 0Stability Based Filter Pruning for Accelerating Deep CNNs Nov 20, 2018 GPU Model Compression
— Unverified 0Effective Model Compression via Stage-wise Pruning Nov 10, 2020 model Model Compression
— Unverified 0Statistical Model Compression for Small-Footprint Natural Language Understanding Jul 19, 2018 Model Compression Natural Language Understanding
— Unverified 0Strategic Fusion Optimizes Transformer Compression Jan 5, 2025 Knowledge Distillation Model Compression
— Unverified 0Streamlining Tensor and Network Pruning in PyTorch Apr 28, 2020 Model Compression Network Pruning
— Unverified 0Structured Bayesian Compression for Deep Neural Networks Based on The Turbo-VBI Approach Feb 21, 2023 Model Compression
— Unverified 0Structured Compression by Weight Encryption for Unstructured Pruning and Quantization May 24, 2019 Model Compression Quantization
— Unverified 0Structured Convolutions for Efficient Neural Network Design Aug 6, 2020 Efficient Neural Network Image Classification
— Unverified 0Structured Model Pruning for Efficient Inference in Computational Pathology Apr 12, 2024 Instance Segmentation Model Compression
— Unverified 0Structured Multi-Hashing for Model Compression Nov 25, 2019 model Model Compression
— Unverified 0Structured Pruning for Multi-Task Deep Neural Networks Apr 13, 2023 Model Compression
— Unverified 0Structured Pruning is All You Need for Pruning CNNs at Initialization Mar 4, 2022 All Model Compression
— Unverified 0Structured Pruning Learns Compact and Accurate Models Nov 16, 2021 Model Compression
— Unverified 0SubCharacter Chinese-English Neural Machine Translation with Wubi encoding Nov 7, 2019 Machine Translation Model Compression
— Unverified 0Sub-network Multi-objective Evolutionary Algorithm for Filter Pruning Oct 22, 2022 Combinatorial Optimization Evolutionary Algorithms
— Unverified 0Surrogate Lagrangian Relaxation: A Path To Retrain-free Deep Neural Network Pruning Apr 8, 2023 image-classification Image Classification
— Unverified 0Survey of Dropout Methods for Deep Neural Networks Apr 25, 2019 Model Compression Survey
— Unverified 0Swallowing the Poison Pills: Insights from Vulnerability Disparity Among LLMs Feb 23, 2025 Data Poisoning Diagnostic
— Unverified 0SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget Jan 30, 2024 GPU Model Compression
— Unverified 0Sweeping Heterogeneity with Smart MoPs: Mixture of Prompts for LLM Task Adaptation Oct 4, 2023 Model Compression Text Summarization
— Unverified 0Swing Distillation: A Privacy-Preserving Knowledge Distillation Framework Dec 16, 2022 Knowledge Distillation Model Compression
— Unverified 0SWITCH: Studying with Teacher for Knowledge Distillation of Large Language Models Oct 25, 2024 Instruction Following Knowledge Distillation
— Unverified 0SWSC: Shared Weight for Similar Channel in LLM Jan 15, 2025 Model Compression
— Unverified 0Synergistic Effects of Knowledge Distillation and Structured Pruning for Self-Supervised Speech Models Feb 9, 2025 Knowledge Distillation Model Compression
— Unverified 0Introducing Pose Consistency and Warp-Alignment for Self-Supervised 6D Object Pose Estimation in Color Images Mar 27, 2020 6D Pose Estimation using RGB Domain Adaptation
— Unverified 0TAID: Temporally Adaptive Interpolated Distillation for Efficient Knowledge Transfer in Language Models Jan 28, 2025 Knowledge Distillation Model Compression
— Unverified 0TaQ-DiT: Time-aware Quantization for Diffusion Transformers Nov 21, 2024 Denoising Model Compression
— Unverified 0Task-Agnostic and Adaptive-Size BERT Compression Jan 1, 2021 Language Modelling Model Compression
— Unverified 0Task-Agnostic Structured Pruning of Speech Representation Models Jun 2, 2023 Model Compression
— Unverified 0Diffusion Model Compression for Image-to-Image Translation Jan 31, 2024 Conditional Image Generation Denoising
— Unverified 0Temporal Action Detection Model Compression by Progressive Block Drop Mar 21, 2025 Action Detection Autonomous Driving
— Unverified 0Tensor Contraction Layers for Parsimonious Deep Nets Jun 1, 2017 Model Compression
— Unverified 0TensorGPT: Efficient Compression of Large Language Models based on Tensor-Train Decomposition Jul 2, 2023 Model Compression
— Unverified 0Tensorial Neural Networks: Generalization of Neural Networks and Application to Model Compression May 25, 2018 Model Compression tensor algebra
— Unverified 0Tensorization is a powerful but underexplored tool for compression and interpretability of neural networks May 26, 2025 Deep Learning Model Compression
— Unverified 0Test-Time Adaptation Toward Personalized Speech Enhancement: Zero-Shot Learning with Knowledge Distillation May 8, 2021 Denoising Knowledge Distillation
— Unverified 0Tetra-AML: Automatic Machine Learning via Tensor Networks Mar 28, 2023 Bayesian Optimization Hyperparameter Optimization
— Unverified 0TextPruner: A Model Pruning Toolkit for Pre-Trained Language Models Mar 30, 2022 Model Compression
— Unverified 0