Achieving binary weight and activation for LLMs using Post-Training Quantization Apr 7, 2025 Quantization
— Unverified 0Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval Mar 23, 2021 Anatomy Content-Based Image Retrieval
— Unverified 0A System-Level Solution for Low-Power Object Detection Sep 24, 2019 CPU Object
— Unverified 0Asynchronous Federated Learning with Bidirectional Quantized Communications and Buffered Aggregation Aug 1, 2023 Federated Learning Quantization
— Unverified 0A Channelized Binning Method for Extraction of Dominant Color Pixel Value May 28, 2016 Quantization
— Unverified 0Asymptotic Unbiased Sample Sampling to Speed Up Sharpness-Aware Minimization Jun 12, 2024 Computational Efficiency Pose Estimation
— Unverified 0Asymptotic tracking control of dynamic reference over homomorphically encrypted data with finite modulus Sep 27, 2024 Quantization
— Unverified 0AgileIR: Memory-Efficient Group Shifted Windows Attention for Agile Image Restoration Sep 10, 2024 Image Restoration Quantization
— Unverified 04-bit Quantization of LSTM-based Speech Recognition Models Aug 27, 2021 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0Asymptotic stabilization under homomorphic encryption: A re-encryption free method Apr 12, 2025 Quantization
— Unverified 0Asymptotic Performance Analysis of Large-Scale Active IRS-Aided Wireless Network May 31, 2023 Quantization
— Unverified 0Aggressive Post-Training Compression on Extremely Large Language Models Sep 30, 2024 Model Compression Network Pruning
— Unverified 0Asymptotic Analysis of One-bit Quantized Box-Constrained Precoding in Large-Scale Multi-User Systems Feb 5, 2025 Quantization
— Unverified 0Asymptotically Optimal Closed-Form Phase Configuration of 1-bit RISs via Sign Alignment Jul 18, 2024 Form Quantization
— Unverified 0Aggregating empirical evidence from data strategy studies: a case on model quantization May 1, 2025 GPU Quantization
— Unverified 0Accurate Sine-Wave Amplitude Measurements Using Nonlinearly Quantized Data Apr 28, 2018 Quantization
— Unverified 0Inference Optimizations for Large Language Models: Effects, Challenges, and Practical Considerations Aug 6, 2024 Knowledge Distillation Navigate
— Unverified 0DeCoR: Defy Knowledge Forgetting by Predicting Earlier Audio Codes May 29, 2023 Acoustic Scene Classification Continual Learning
— Unverified 0Asymmetric Learning Vector Quantization for Efficient Nearest Neighbor Classification in Dynamic Time Warping Spaces Mar 24, 2017 Classification Dynamic Time Warping
— Unverified 0Asymmetric Learned Image Compression with Multi-Scale Residual Block, Importance Map, and Post-Quantization Filtering Jun 21, 2022 Decoder Image Compression
— Unverified 0Aggregated Learning: A Deep Learning Framework Based on Information-Bottleneck Vector Quantization Jul 26, 2018 Image Classification Quantization
— Unverified 0Asymmetric Deep Semantic Quantization for Image Retrieval Mar 29, 2019 Image Retrieval Quantization
— Unverified 0Asymmetric Correlation Quantization Hashing for Cross-modal Retrieval Jan 14, 2020 Cross-Modal Retrieval Quantization
— Unverified 0L3iTC at the FinLLM Challenge Task: Quantization for Financial Text Classification & Summarization Aug 6, 2024 GPU Quantization
— Unverified 0AsymKV: Enabling 1-Bit Quantization of KV Cache with Layer-Wise Asymmetric Quantization Configurations Oct 17, 2024 Decoder Quantization
— Unverified 0A Generalized Zero-Shot Quantization of Deep Convolutional Neural Networks via Learned Weights Statistics Dec 6, 2021 Quantization
— Unverified 0A Survey on Transformer Compression Feb 5, 2024 Knowledge Distillation Mamba
— Unverified 0A Survey on Model Compression for Large Language Models Aug 15, 2023 Benchmarking Knowledge Distillation
— Unverified 0A General Family of Stochastic Proximal Gradient Methods for Deep Learning Jul 15, 2020 Quantization
— Unverified 0Accurate INT8 Training Through Dynamic Block-Level Fallback Mar 11, 2025 Quantization
— Unverified 0A Survey on Methods and Theories of Quantized Neural Networks Aug 13, 2018 Quantization speech-recognition
— Unverified 0A Survey on Learning to Hash Jun 1, 2016 Quantization Survey
— Unverified 0A General Error-Theoretical Analysis Framework for Constructing Compression Strategies Feb 19, 2025 Quantization
— Unverified 0Accurate Deep Representation Quantization with Gradient Snapping Layer for Similarity Search Oct 30, 2016 Quantization
— Unverified 0A survey on efficient vision transformers: algorithms, techniques, and performance benchmarking Sep 5, 2023 Benchmarking Knowledge Distillation
— Unverified 0A Survey on Deep Hashing Methods Mar 4, 2020 Deep Hashing Domain Adaptation
— Unverified 0A Formalization of Image Vectorization by Region Merging Sep 24, 2024 Image Segmentation Quantization
— Unverified 0A Survey of Techniques for Optimizing Transformer Inference Jul 16, 2023 Knowledge Distillation Neural Architecture Search
— Unverified 0A Survey of Small Language Models Oct 25, 2024 Benchmarking Model Compression
— Unverified 0A Flexible, Extensible Software Framework for Neural Net Compression Oct 20, 2018 Quantization
— Unverified 01-bit Quantized On-chip Hybrid Diffraction Neural Network Enabled by Authentic All-optical Fully-connected Architecture Apr 11, 2024 All Lesion Detection
— Unverified 0Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression Mar 18, 2024 Ethics Fairness
— Unverified 0Decoupled Greedy Learning of CNNs for Synchronous and Asynchronous Distributed Learning Jun 11, 2021 image-classification Image Classification
— Unverified 0A Survey of Quantization Methods for Efficient Neural Network Inference Mar 25, 2021 Efficient Neural Network Quantization
— Unverified 0A Survey of Model Compression and Acceleration for Deep Neural Networks Oct 23, 2017 Benchmarking Knowledge Distillation
— Unverified 0A flexible, extensible software framework for model compression based on the LC algorithm May 15, 2020 BIG-bench Machine Learning Low-rank compression
— Unverified 0A Survey of Methods for Low-Power Deep Learning and Computer Vision Mar 24, 2020 Knowledge Distillation Quantization
— Unverified 0A Survey of Low-bit Large Language Models: Basics, Systems, and Algorithms Sep 25, 2024 Quantization
— Unverified 0Accurate Compression of Text-to-Image Diffusion Models via Vector Quantization Aug 31, 2024 Image Generation Quantization
— Unverified 0A Study on Unsupervised Dictionary Learning and Feature Encoding for Action Classification Sep 2, 2013 Action Classification Dictionary Learning
— Unverified 0