Cross-Layer Optimization for Fault-Tolerant Deep Learning Dec 21, 2023 Bayesian Optimization Deep Learning
— Unverified 0TinySAM: Pushing the Envelope for Efficient Segment Anything Model Dec 21, 2023 Knowledge Distillation Quantization
Code Code Available 2Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity Dec 20, 2023 Federated Learning Personalized Federated Learning
Code Code Available 0Towards Efficient Verification of Quantized Neural Networks Dec 20, 2023 Heuristic Search Quantization
Code Code Available 0Mini-GPTs: Efficient Large Language Models through Contextual Pruning Dec 20, 2023 Articles Quantization
Code Code Available 1Find the Lady: Permutation and Re-Synchronization of Deep Neural Networks Dec 19, 2023 Quantization
Code Code Available 0Compact 3D Scene Representation via Self-Organizing Gaussian Grids Dec 19, 2023 3DGS
Code Code Available 3SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search Dec 19, 2023 Neural Architecture Search Quantization
— Unverified 0Quantized Decoder in Learned Image Compression for Deterministic Reconstruction Dec 18, 2023 Decoder Image Compression
— Unverified 0Power-Efficient Sampling Dec 18, 2023 Quantization
— Unverified 0Post-Training Quantization for Re-parameterization via Coarse & Fine Weight Splitting Dec 17, 2023 Quantization
Code Code Available 0StyleSinger: Style Transfer for Out-of-Domain Singing Voice Synthesis Dec 17, 2023 Quantization Singing Voice Synthesis
Code Code Available 2SPT: Fine-Tuning Transformer-based Language Models Efficiently with Sparsification Dec 16, 2023 Quantization
Code Code Available 0Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Dec 15, 2023 Quantization speech-recognition
Code Code Available 0IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding Dec 15, 2023 Image Quality Assessment Quantization
— Unverified 0ZeroQuant(4+2): Redefining LLMs Quantization with a New FP6-Centric Strategy for Diverse Generative Tasks Dec 14, 2023 Abstractive Text Summarization Code Generation
Code Code Available 2Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Dec 14, 2023 Quantization Visual Place Recognition
— Unverified 0USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CBQ: Cross-Block Quantization for Large Language Models Dec 13, 2023 GPU Quantization
— Unverified 0When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate, & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks Dec 12, 2023 Quantization
— Unverified 0IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means Dec 12, 2023 Efficient Neural Network Quantization
— Unverified 0Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization Dec 12, 2023 Clustering Dimensionality Reduction
— Unverified 0Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills Dec 11, 2023 continuous-control Continuous Control
Code Code Available 0Neural Architecture Codesign for Fast Bragg Peak Analysis Dec 10, 2023 AutoML Model Compression
— Unverified 0FP8-BERT: Post-Training Quantization for Transformer Dec 10, 2023 Quantization
— Unverified 0QMGeo: Differentially Private Federated Learning via Stochastic Quantization with Mixed Truncated Geometric Distribution Dec 10, 2023 Federated Learning Quantization
— Unverified 0Automotive Radar Sensing with Sparse Linear Arrays Using One-Bit Hankel Matrix Completion Dec 9, 2023 Matrix Completion Quantization
— Unverified 0Understanding the Effect of Model Compression on Social Bias in Large Language Models Dec 9, 2023 Knowledge Distillation Model Compression
Code Code Available 0Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge Dec 9, 2023 Language Modeling Language Modelling
Code Code Available 0Efficient Quantization Strategies for Latent Diffusion Models Dec 9, 2023 Image Generation Quantization
— Unverified 0An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis Dec 8, 2023 Benchmarking Quantization
— Unverified 0GenQ: Quantization in Low Data Regimes with Generative Synthetic Data Dec 7, 2023 Computational Efficiency Quantization
Code Code Available 0Rate-splitting Multiple Access for Hierarchical HAP-LAP Networks under Limited Fronthaul Dec 7, 2023 Quantization
— Unverified 0SmoothQuant+: Accurate and Efficient 4-bit Post-Training WeightQuantization for LLM Dec 6, 2023 GPU Quantization
Code Code Available 1Does Vector Quantization Fail in Spatio-Temporal Forecasting? Exploring a Differentiable Sparse Soft-Vector Quantization Approach Dec 6, 2023 Attribute Computational Efficiency
Code Code Available 1Enhancing Kinship Verification through Multiscale Retinex and Combined Deep-Shallow features Dec 6, 2023 Kinship Verification Quantization
— Unverified 0All Rivers Run to the Sea: Private Learning with Asymmetric Flows Dec 5, 2023 All Quantization
— Unverified 0Unified learning-based lossy and lossless JPEG recompression Dec 5, 2023 Image Compression Quantization
— Unverified 0PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off Dec 4, 2023 Binarization Computational Efficiency
Code Code Available 0QuantAttack: Exploiting Dynamic Quantization to Attack Vision Transformers Dec 3, 2023 Quantization
Code Code Available 0Low-Precision Mixed-Computation Models for Inference on Edge Dec 3, 2023 Quantization
— Unverified 0Adaptive Resource Allocation for Semantic Communication Networks Dec 2, 2023 Deep Reinforcement Learning Quantization
— Unverified 0Physics Inspired Criterion for Pruning-Quantization Joint Learning Dec 1, 2023 image-classification Image Classification
Code Code Available 0The Cost of Compression: Investigating the Impact of Compression on Parametric Knowledge in Language Models Dec 1, 2023 Decoder Quantization
Code Code Available 0Improving the Robustness of Quantized Deep Neural Networks to White-Box Attacks using Stochastic Quantization and Information-Theoretic Ensemble Training Nov 30, 2023 Diversity Information Plane
— Unverified 0Routing-Guided Learned Product Quantization for Graph-Based Approximate Nearest Neighbor Search Nov 30, 2023 Quantization
Code Code Available 0A New Old Idea: Beam-Steering Reflectarrays for Efficient Sub-THz Multiuser MIMO Nov 30, 2023 3D geometry Quantization
— Unverified 0CompGS: Smaller and Faster Gaussian Splatting with Vector Quantization Nov 30, 2023 3DGS NeRF
Code Code Available 2Language Embedded 3D Gaussians for Open-Vocabulary Scene Understanding Nov 30, 2023 GPU Inductive Bias
Code Code Available 1Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices Nov 29, 2023 Benchmarking Federated Learning
— Unverified 0