Harnessing Large Language Models Locally: Empirical Results and Implications for AI PC May 21, 2025 CPU Quantization
Code Code Available 05 A Mean Field Theory of Quantized Deep Networks: The Quantization-Depth Trade-Off Jun 3, 2019 Quantization
Code Code Available 05 Hardware Acceleration for Real-Time Wildfire Detection Onboard Drone Networks Jan 16, 2024 Classification image-classification
Code Code Available 05 DQRM: Deep Quantized Recommendation Models Oct 26, 2024 Quantization
Code Code Available 05 HERO: Hessian-Enhanced Robust Optimization for Unifying and Improving Generalization and Quantization Performance Nov 23, 2021 Quantization
Code Code Available 05 Deep Triplet Quantization Feb 1, 2019 Deep Hashing Image Retrieval
Code Code Available 05 Mirror Descent View for Neural Network Quantization Oct 18, 2019 Quantization valid
Code Code Available 05 GT-SVQ: A Linear-Time Graph Transformer for Node Classification Using Spiking Vector Quantization Apr 16, 2025 Graph Learning Graph Representation Learning
Code Code Available 05 Deep Task-Based Analog-to-Digital Conversion Jan 29, 2022 Meta-Learning Quantization
Code Code Available 05 GSB: Group Superposition Binarization for Vision Transformer with Limited Training Samples May 13, 2023 Binarization Knowledge Distillation
Code Code Available 05 Mixed-Precision Quantization and Parallel Implementation of Multispectral Riemannian Classification for Brain--Machine Interfaces Feb 22, 2021 General Classification Motor Imagery
Code Code Available 05 Mixed-Precision Quantization for Deep Vision Models with Integer Quadratic Programming Jul 11, 2023 Quantization Sensitivity
Code Code Available 05 Guetzli: Perceptually Guided JPEG Encoder Mar 13, 2017 Perceptual Distance Quantization
Code Code Available 05 Bag of Tricks for Optimizing Transformer Efficiency Sep 9, 2021 CPU Decoder
Code Code Available 05 DeepShift: Towards Multiplication-Less Neural Networks May 30, 2019 Edge-computing GPU
Code Code Available 05 GraNNite: Enabling High-Performance Execution of Graph Neural Networks on Resource-Constrained Neural Processing Units Feb 10, 2025 Event-based vision Quantization
Code Code Available 05 GQFedWAvg: Optimization-Based Quantized Federated Learning in General Edge Computing Systems Jun 13, 2023 Edge-computing Federated Learning
Code Code Available 05 Deep reverse tone mapping Nov 20, 2017 inverse tone mapping Quantization
Code Code Available 05 Deep residual network for steganalysis of digital images Sep 23, 2018 Image Steganography Quantization
Code Code Available 05 Deep Recurrent Quantization for Generating Sequential Binary Codes Jun 16, 2019 Image Retrieval Quantization
Code Code Available 05 Goten: GPU-Outsourcing Trusted Execution of Neural Network Training and Prediction Sep 25, 2019 GPU Privacy Preserving
Code Code Available 05 Hardening DNNs against Transfer Attacks during Network Compression using Greedy Adversarial Pruning Jun 15, 2022 Adversarial Robustness Quantization
Code Code Available 05 Deep Priority Hashing Sep 4, 2018 Deep Hashing Image Retrieval
Code Code Available 05 Deep Optimized Multiple Description Image Coding via Scalar Quantization Learning Jan 12, 2020 Decoder Quantization
Code Code Available 05 Genie: Show Me the Data for Quantization Dec 9, 2022 Data Free Quantization Quantization
Code Code Available 05 General Point Model Pretraining with Autoencoding and Autoregressive Jan 1, 2024 Decoder Language Modeling
Code Code Available 05 Generalized Relevance Learning Grassmann Quantization Mar 14, 2024 Activity Recognition Face Recognition
Code Code Available 05 Generalized Learning Vector Quantization for Classification in Randomized Neural Networks and Hyperdimensional Computing Jun 17, 2021 BIG-bench Machine Learning Quantization
Code Code Available 05 A2Q+: Improving Accumulator-Aware Weight Quantization Jan 19, 2024 Quantization
Code Code Available 05 Deep Neural Network for Respiratory Sound Classification in Wearable Devices Enabled by Patient Specific Model Tuning Apr 16, 2020 Anomaly Detection General Classification
Code Code Available 05 Deep Neural Network Compression with Single and Multiple Level Quantization Mar 6, 2018 Neural Network Compression Quantization
Code Code Available 05 Exploring the Trade-Offs: Quantization Methods, Task Difficulty, and Model Size in Large Language Models From Edge to Giant Sep 17, 2024 Hallucination Instruction Following
Code Code Available 05 FTT-NAS: Discovering Fault-Tolerant Convolutional Neural Architecture Mar 20, 2020 Neural Architecture Search Quantization
Code Code Available 05 A LoRA-Based Approach to Fine-Tuning LLMs for Educational Guidance in Resource-Constrained Settings Apr 22, 2025 Computational Efficiency GPU
Code Code Available 05 Deep Metric Learning to Rank Jun 1, 2019 Image Retrieval Learning-To-Rank
Code Code Available 05 EAQuant: Enhancing Post-Training Quantization for MoE Models via Expert-Aware Optimization Jun 16, 2025 Mixture-of-Experts Model Compression
Code Code Available 05 GANQ: GPU-Adaptive Non-Uniform Quantization for Large Language Models Jan 22, 2025 GPU Quantization
Code Code Available 05 Deep Log-Likelihood Ratio Quantization Mar 11, 2019 Decoder Quantization
Code Code Available 05 All You Need is a Few Shifts: Designing Efficient Convolutional Neural Networks for Image Classification Mar 13, 2019 All General Classification
Code Code Available 05 Deep Learning with Low Precision by Half-wave Gaussian Quantization Feb 3, 2017 Deep Learning Quantization
Code Code Available 05 Deep Learning Models in Speech Recognition: Measuring GPU Energy Consumption, Impact of Noise and Model Quantization for Edge Deployment May 2, 2024 GPU NVIDIA Jetson Orin Nano
Code Code Available 05 FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design May 22, 2025 GPU Image Generation
Code Code Available 05 A Comprehensive Evaluation of Quantization Strategies for Large Language Models Feb 26, 2024 Language Modeling Language Modelling
Code Code Available 05 ECQ^x: Explainability-Driven Quantization for Low-Bit and Sparse DNNs Sep 9, 2021 Explainable Artificial Intelligence (XAI) Quantization
Code Code Available 05 Foundations of Large Language Model Compression -- Part 1: Weight Quantization Sep 3, 2024 Language Modeling Language Modelling
Code Code Available 05 FP4DiT: Towards Effective Floating Point Quantization for Diffusion Transformers Mar 19, 2025 Image Generation Quantization
Code Code Available 05 Neural Network Assisted Lifting Steps For Improved Fully Scalable Lossy Image Compression in JPEG 2000 Mar 4, 2024 Image Compression Quantization
Code Code Available 05 FLoCoRA: Federated learning compression with low-rank adaptation Jun 20, 2024 Federated Learning Model Compression
Code Code Available 05 Deep Learning-Based Quantization of L-Values for Gray-Coded Modulation Jun 18, 2019 Quantization
Code Code Available 05 Floating-Point Quantization Analysis of Multi-Layer Perceptron Artificial Neural Networks Mar 18, 2024 Quantization
Code Code Available 05