Small Language Models: Architectures, Techniques, Evaluation, Problems and Future Adaptation May 26, 2025 Model Compression Quantization
— Unverified 00 Small Object Detection Based on Modified FSSD and Model Compression Aug 24, 2021 Model Compression object-detection
— Unverified 00 Smart Environmental Monitoring of Marine Pollution using Edge AI Apr 30, 2025 Edge-computing Model Compression
— Unverified 00 SmartExchange: Trading Higher-cost Memory Storage/Access for Lower-cost Computation May 7, 2020 Model Compression Quantization
— Unverified 00 Smooth Model Compression without Fine-Tuning May 30, 2025 model Model Compression
— Unverified 00 CrAFT: Compression-Aware Fine-Tuning for Efficient Visual Task Adaptation May 8, 2023 GPU Model Compression
— Unverified 00 Soft Labeling Affects Out-of-Distribution Detection of Deep Neural Networks Jul 7, 2020 Model Compression Out-of-Distribution Detection
— Unverified 00 Sometimes Painful but Certainly Promising: Feasibility and Trade-offs of Language Model Inference at the Edge Mar 12, 2025 CPU GPU
— Unverified 00 SpaLLM: Unified Compressive Adaptation of Large Language Models with Sketching Oct 8, 2024 Model Compression Natural Language Understanding
— Unverified 00 Sparse Deep Learning for Time Series Data: Theory and Applications Oct 5, 2023 Deep Learning Model Compression
— Unverified 00 AdaDeep: A Usage-Driven, Automated Deep Model Compression Framework for Enabling Ubiquitous Intelligent Mobiles Jun 8, 2020 Model Compression
— Unverified 00 Sparse Unbalanced GAN Training with In-Time Over-Parameterization Sep 29, 2021 Model Compression
— Unverified 00 Spatio-Temporal Pruning and Quantization for Low-latency Spiking Neural Networks Apr 26, 2021 Model Compression Quantization
— Unverified 00 Activation Sparsity Opportunities for Compressing General Large Language Models Dec 13, 2024 Model Compression
— Unverified 00 Compressible Spectral Mixture Kernels with Sparse Dependency Structures for Gaussian Processes Aug 1, 2018 Gaussian Processes Model Compression
— Unverified 00 Spectral Pruning: Compressing Deep Neural Networks via Spectral Analysis and its Generalization Error Aug 26, 2018 Edge-computing Learning Theory
— Unverified 00 Comprehensive Study on Performance Evaluation and Optimization of Model Compression: Bridging Traditional Deep Learning and Large Language Models Jul 22, 2024 Deep Learning image-classification
— Unverified 00 Comprehensive Survey of Model Compression and Speed up for Vision Transformers Apr 16, 2024 Computational Efficiency Edge-computing
— Unverified 00 Speeding up Convolutional Neural Networks with Low Rank Expansions May 15, 2014 CPU GPU
— Unverified 00 Compressed models are NOT miniature versions of large models Jul 18, 2024 Adversarial Attack Model Compression
— Unverified 00 Speeding Up Image Classifiers with Little Companions Jun 24, 2024 image-classification Image Classification
— Unverified 00 USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 00 Compressing Cross-Lingual Multi-Task Models at Qualtrics Nov 29, 2022 Management Model Compression
— Unverified 00 Compressing Deep Convolutional Neural Networks by Stacking Low-dimensional Binary Convolution Filters Oct 6, 2020 Model Compression
— Unverified 00 Compressing Deep Neural Networks via Layer Fusion Jul 29, 2020 Exponential degradation Language Modelling
— Unverified 00 Compositionality Unlocks Deep Interpretable Models Apr 3, 2025 Model Compression Tensor Networks
— Unverified 00 Compressing Large-Scale Transformer-Based Models: A Case Study on BERT Feb 27, 2020 Model Compression
— Unverified 00 Compressing Low Precision Deep Neural Networks Using Sparsity-Induced Regularization in Ternary Networks Sep 19, 2017 L2 Regularization Model Compression
— Unverified 00 Compressing Pre-trained Language Models by Matrix Decomposition Dec 1, 2020 Model Compression
— Unverified 00 Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Oct 1, 2024 Computational Efficiency Knowledge Distillation
— Unverified 00 Compressing Recurrent Neural Networks Using Hierarchical Tucker Tensor Decomposition May 9, 2020 Model Compression Tensor Decomposition
— Unverified 00 Spirit Distillation: A Model Compression Method with Multi-domain Knowledge Transfer Apr 29, 2021 General Knowledge Knowledge Distillation
— Unverified 00 Sponge Attacks on Sensing AI: Energy-Latency Vulnerabilities and Defense via Model Pruning May 9, 2025 Model Compression
— Unverified 00 CompMarkGS: Robust Watermarking for Compressed 3D Gaussian Splatting Mar 17, 2025 3DGS 3D Reconstruction
— Unverified 00 Compression and Localization in Reinforcement Learning for ATARI Games Apr 20, 2019 Atari Games Model Compression
— Unverified 00 Activation Map Adaptation for Effective Knowledge Distillation Oct 26, 2020 Knowledge Distillation Model Compression
— Unverified 00 Complexity-Driven CNN Compression for Resource-constrained Edge AI Aug 26, 2022 Computational Efficiency Model Compression
— Unverified 00 Compression for Better: A General and Stable Lossless Compression Framework Dec 9, 2024 Computational Efficiency Model Compression
— Unverified 00 Compression Laws for Large Language Models Apr 6, 2025 Model Compression
— Unverified 00 Compression of Deep Neural Networks by combining pruning and low rank decomposition Oct 20, 2018 Model Compression
— Unverified 00 Compression of Deep Neural Networks for Image Instance Retrieval Jan 18, 2017 Image Instance Retrieval Model Compression
— Unverified 00 Compression of Generative Pre-trained Language Models via Quantization Mar 21, 2022 Model Compression Quantization
— Unverified 00 Compacting Deep Neural Networks for Internet of Things: Methods and Applications Mar 20, 2021 Diversity Knowledge Distillation
— Unverified 00 Compress, Then Prompt: Improving Accuracy-Efficiency Trade-off of LLM Inference with Transferable Prompt May 17, 2023 GPU Model Compression
— Unverified 00 Compress then Serve: Serving Thousands of LoRA Adapters with Little Overhead Jun 17, 2024 GPU Model Compression
— Unverified 00 Computation-efficient Deep Learning for Computer Vision: A Survey Aug 27, 2023 Autonomous Vehicles Deep Learning
— Unverified 00 CompactifAI: Extreme Compression of Large Language Models using Quantum-Inspired Tensor Networks Jan 25, 2024 Model Compression Quantization
— Unverified 00 Activation Density based Mixed-Precision Quantization for Energy Efficient Neural Networks Jan 12, 2021 Model Compression Quantization
— Unverified 00 ConaCLIP: Exploring Distillation of Fully-Connected Knowledge Interaction Graph for Lightweight Text-Image Retrieval May 28, 2023 Image Retrieval Knowledge Distillation
— Unverified 00 Conditional Automated Channel Pruning for Deep Neural Networks Sep 21, 2020 Model Compression
— Unverified 00