A Survey on Model Compression for Large Language Models Aug 15, 2023 Benchmarking Knowledge Distillation
— Unverified 0EQ-Net: Elastic Quantization Neural Networks Aug 15, 2023 Quantization
Code Code Available 1AKVSR: Audio Knowledge Empowered Visual Speech Recognition by Compressing Audio Knowledge of a Pretrained Model Aug 15, 2023 Quantization speech-recognition
— Unverified 0Federated Classification in Hyperbolic Spaces via Secure Aggregation of Convex Hulls Aug 14, 2023 Federated Learning graph partitioning
Code Code Available 0Efficient Neural PDE-Solvers using Quantization Aware Training Aug 14, 2023 Quantization
— Unverified 0Unified Data-Free Compression: Pruning and Quantization without Fine-Tuning Aug 14, 2023 image-classification Image Classification
— Unverified 0Token-Scaled Logit Distillation for Ternary Weight Generative Language Models Aug 13, 2023 Arithmetic Reasoning Common Sense Reasoning
Code Code Available 1RMP-Loss: Regularizing Membrane Potential Distribution for Spiking Neural Networks Aug 13, 2023 Quantization
Code Code Available 1Sensitivity-Aware Mixed-Precision Quantization and Width Optimization of Deep Neural Networks Through Cluster-Based Tree-Structured Parzen Estimation Aug 12, 2023 Quantization Sensitivity
— Unverified 0Enhancing Generalization of Universal Adversarial Perturbation through Gradient Aggregation Aug 11, 2023 Quantization
Code Code Available 1NUPES : Non-Uniform Post-Training Quantization via Power Exponent Search Aug 10, 2023 Quantization
— Unverified 0ReLU and Addition-based Gated RNN Aug 10, 2023 CPU Handwritten Text Recognition
— Unverified 0FPGA Resource-aware Structured Pruning for Real-Time Neural Networks Aug 9, 2023 Classification image-classification
— Unverified 0Vector quantization loss analysis in VQGANs: a single-GPU ablation study for image-to-image synthesis Aug 9, 2023 GPU Image Generation
Code Code Available 0SAfER: Layer-Level Sensitivity Assessment for Efficient and Robust Neural Network Inference Aug 9, 2023 Autonomous Driving Quantization
— Unverified 0Exploring Frequency-Inspired Optimization in Transformer for Efficient Single Image Super-Resolution Aug 9, 2023 Image Super-Resolution Quantization
Code Code Available 1Quantization Aware Factorization for Deep Neural Network Compression Aug 8, 2023 Neural Network Compression Quantization
— Unverified 0EFaR 2023: Efficient Face Recognition Competition Aug 8, 2023 Face Recognition Lightweight Face Recognition
Code Code Available 1FLIQS: One-Shot Mixed-Precision Floating-Point and Integer Quantization Search Aug 7, 2023 Quantization
— Unverified 0Anonymizing Speech: Evaluating and Designing Speaker Anonymization Techniques Aug 5, 2023 Quantization Speaker anonymization
Code Code Available 1Frequency Disentangled Features in Neural Image Compression Aug 4, 2023 Disentanglement Image Compression
— Unverified 0Reducing Channel Estimation and Feedback Overhead in IRS-Aided Downlink System: A Quantize-then-Estimate Approach Aug 4, 2023 Quantization
— Unverified 0Communication-Efficient Decentralized Multi-Agent Reinforcement Learning for Cooperative Adaptive Cruise Control Aug 4, 2023 Autonomous Vehicles Multi-agent Reinforcement Learning
— Unverified 0RobustMQ: Benchmarking Robustness of Quantized Models Aug 4, 2023 Adversarial Robustness Benchmarking
— Unverified 0VQGraph: Rethinking Graph Representation Space for Bridging GNNs and MLPs Aug 4, 2023 Knowledge Distillation Quantization
Code Code Available 1Textless Unit-to-Unit training for Many-to-Many Multilingual Speech-to-Speech Translation Aug 3, 2023 Decoder Quantization
Code Code Available 1Bees Local Phase Quantization Feature Selection for RGB-D Facial Expressions Recognition Aug 3, 2023 feature selection Quantization
Code Code Available 0Improved Knowledge Distillation for Crowd Counting on IoT Device Aug 2, 2023 Crowd Counting Knowledge Distillation
Code Code Available 0Error Analysis of CORDIC Processor with FPGA Implementation Aug 2, 2023 Quantization
— Unverified 0Tango: rethinking quantization for graph neural network training on GPUs Aug 2, 2023 Graph Neural Network Quantization
— Unverified 0MRQ:Support Multiple Quantization Schemes through Model Re-Quantization Aug 1, 2023 model Quantization
— Unverified 0Asynchronous Federated Learning with Bidirectional Quantized Communications and Buffered Aggregation Aug 1, 2023 Federated Learning Quantization
— Unverified 0AQUILA: Communication Efficient Federated Learning with Adaptive Quantization in Device Selection Strategy Aug 1, 2023 Federated Learning Privacy Preserving
— Unverified 0Alternate Learning based Sparse Semantic Communications for Visual Transmission Jul 31, 2023 Quantization Semantic Communication
— Unverified 0Lightweight Super-Resolution Head for Human Pose Estimation Jul 31, 2023 Pose Estimation Quantization
Code Code Available 1Revisiting the Parameter Efficiency of Adapters from the Perspective of Precision Redundancy Jul 31, 2023 Quantization
Code Code Available 1BearingPGA-Net: A Lightweight and Deployable Bearing Fault Diagnosis Network via Decoupled Knowledge Distillation and FPGA Acceleration Jul 31, 2023 CPU Fault Diagnosis
Code Code Available 1METTS: Multilingual Emotional Text-to-Speech by Cross-speaker and Cross-lingual Emotion Transfer Jul 29, 2023 Disentanglement Diversity
— Unverified 0An Automata-Theoretic Approach to Synthesizing Binarized Neural Networks Jul 29, 2023 Fairness Quantization
— Unverified 0Incrementally-Computable Neural Networks: Efficient Inference for Dynamic Inputs Jul 27, 2023 Document Classification Knowledge Distillation
— Unverified 0QuIP: 2-Bit Quantization of Large Language Models With Guarantees Jul 25, 2023 Quantization
Code Code Available 2Overcoming Distribution Mismatch in Quantizing Image Super-Resolution Networks Jul 25, 2023 Image Classification Image Super-Resolution
Code Code Available 0High-Resolution Volumetric Reconstruction for Clothed Humans Jul 25, 2023 Quantization
— Unverified 0A Model for Every User and Budget: Label-Free and Personalized Mixed-Precision Quantization Jul 24, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 0Model Compression Methods for YOLOv5: A Review Jul 21, 2023 Knowledge Distillation model
— Unverified 0Communication-Efficient Federated Learning over Capacity-Limited Wireless Networks Jul 20, 2023 Federated Learning Quantization
— Unverified 0Quantized Feature Distillation for Network Quantization Jul 20, 2023 image-classification Image Classification
— Unverified 0Communication-Efficient Split Learning via Adaptive Feature-Wise Compression Jul 20, 2023 Quantization
— Unverified 0EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization Jul 20, 2023 Quantization
Code Code Available 1ZeroQuant-FP: A Leap Forward in LLMs Post-Training W4A8 Quantization Using Floating-Point Formats Jul 19, 2023 Computational Efficiency Quantization
— Unverified 0