A Cost-Efficient FPGA Implementation of Tiny Transformer Model using Neural ODE Jan 5, 2024 CPU Edge-computing
— Unverified 0Enhancing Generalization of Invisible Facial Privacy Cloak via Gradient Accumulation Jan 3, 2024 Face Recognition Quantization
— Unverified 0Model-Free Learning for the Linear Quadratic Regulator over Rate-Limited Channels Jan 2, 2024 Quantization
— Unverified 0General Point Model Pretraining with Autoencoding and Autoregressive Jan 1, 2024 Decoder Language Modeling
Code Code Available 0Are Conventional SNNs Really Efficient? A Perspective from Network Quantization Jan 1, 2024 Fairness Quantization
— Unverified 0Reg-PTQ: Regression-specialized Post-training Quantization for Fully Quantized Object Detector Jan 1, 2024 Object object-detection
— Unverified 0Enhancing Post-training Quantization Calibration through Contrastive Learning Jan 1, 2024 Contrastive Learning Quantization
— Unverified 0PredToken: Predicting Unknown Tokens and Beyond with Coarse-to-Fine Iterative Decoding Jan 1, 2024 Quantization
— Unverified 0PikeLPN: Mitigating Overlooked Inefficiencies of Low-Precision Neural Networks Jan 1, 2024 Quantization
— Unverified 0Data-Free Quantization via Pseudo-label Filtering Jan 1, 2024 Data Free Quantization Model Compression
— Unverified 0HQ-VAE: Hierarchical Discrete Representation Learning with Variational Bayes Dec 31, 2023 Quantization Representation Learning
— Unverified 0Compact Neural Graphics Primitives with Learned Hash Probing Dec 28, 2023 Quantization
— Unverified 0FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNs Dec 27, 2023 Benchmarking GPU
Code Code Available 0A-SDM: Accelerating Stable Diffusion through Redundancy Removal and Performance Optimization Dec 24, 2023 Quantization
— Unverified 0Efficient Asynchronous Federated Learning with Sparsification and Quantization Dec 23, 2023 Federated Learning Quantization
— Unverified 0Hardware-Aware DNN Compression via Diverse Pruning and Mixed-Precision Quantization Dec 23, 2023 Quantization Reinforcement Learning (RL)
— Unverified 0Cross-Layer Optimization for Fault-Tolerant Deep Learning Dec 21, 2023 Bayesian Optimization Deep Learning
— Unverified 0Fed-QSSL: A Framework for Personalized Federated Learning under Bitwidth and Data Heterogeneity Dec 20, 2023 Federated Learning Personalized Federated Learning
Code Code Available 0Towards Efficient Verification of Quantized Neural Networks Dec 20, 2023 Heuristic Search Quantization
Code Code Available 0Find the Lady: Permutation and Re-Synchronization of Deep Neural Networks Dec 19, 2023 Quantization
Code Code Available 0SimQ-NAS: Simultaneous Quantization Policy and Neural Architecture Search Dec 19, 2023 Neural Architecture Search Quantization
— Unverified 0Power-Efficient Sampling Dec 18, 2023 Quantization
— Unverified 0Quantized Decoder in Learned Image Compression for Deterministic Reconstruction Dec 18, 2023 Decoder Image Compression
— Unverified 0Post-Training Quantization for Re-parameterization via Coarse & Fine Weight Splitting Dec 17, 2023 Quantization
Code Code Available 0SPT: Fine-Tuning Transformer-based Language Models Efficiently with Sparsification Dec 16, 2023 Quantization
Code Code Available 0IQNet: Image Quality Assessment Guided Just Noticeable Difference Prefiltering For Versatile Video Coding Dec 15, 2023 Image Quality Assessment Quantization
— Unverified 0Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference Dec 15, 2023 Quantization speech-recognition
Code Code Available 0Design Space Exploration of Low-Bit Quantized Neural Networks for Visual Place Recognition Dec 14, 2023 Quantization Visual Place Recognition
— Unverified 0USM-Lite: Quantization and Sparsity Aware Fine-tuning for Speech Recognition with Universal Speech Models Dec 13, 2023 Automatic Speech Recognition Automatic Speech Recognition (ASR)
— Unverified 0CBQ: Cross-Block Quantization for Large Language Models Dec 13, 2023 GPU Quantization
— Unverified 0When Bio-Inspired Computing meets Deep Learning: Low-Latency, Accurate, & Energy-Efficient Spiking Neural Networks from Artificial Neural Networks Dec 12, 2023 Quantization
— Unverified 0Expand-and-Quantize: Unsupervised Semantic Segmentation Using High-Dimensional Space and Product Quantization Dec 12, 2023 Clustering Dimensionality Reduction
— Unverified 0IDKM: Memory Efficient Neural Network Quantization via Implicit, Differentiable k-Means Dec 12, 2023 Efficient Neural Network Quantization
— Unverified 0Decoupling Meta-Reinforcement Learning with Gaussian Task Contexts and Skills Dec 11, 2023 continuous-control Continuous Control
Code Code Available 0FP8-BERT: Post-Training Quantization for Transformer Dec 10, 2023 Quantization
— Unverified 0QMGeo: Differentially Private Federated Learning via Stochastic Quantization with Mixed Truncated Geometric Distribution Dec 10, 2023 Federated Learning Quantization
— Unverified 0Neural Architecture Codesign for Fast Bragg Peak Analysis Dec 10, 2023 AutoML Model Compression
— Unverified 0Agile-Quant: Activation-Guided Quantization for Faster Inference of LLMs on the Edge Dec 9, 2023 Language Modeling Language Modelling
Code Code Available 0Efficient Quantization Strategies for Latent Diffusion Models Dec 9, 2023 Image Generation Quantization
— Unverified 0Automotive Radar Sensing with Sparse Linear Arrays Using One-Bit Hankel Matrix Completion Dec 9, 2023 Matrix Completion Quantization
— Unverified 0Understanding the Effect of Model Compression on Social Bias in Large Language Models Dec 9, 2023 Knowledge Distillation Model Compression
Code Code Available 0An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis Dec 8, 2023 Benchmarking Quantization
— Unverified 0GenQ: Quantization in Low Data Regimes with Generative Synthetic Data Dec 7, 2023 Computational Efficiency Quantization
Code Code Available 0Rate-splitting Multiple Access for Hierarchical HAP-LAP Networks under Limited Fronthaul Dec 7, 2023 Quantization
— Unverified 0Enhancing Kinship Verification through Multiscale Retinex and Combined Deep-Shallow features Dec 6, 2023 Kinship Verification Quantization
— Unverified 0All Rivers Run to the Sea: Private Learning with Asymmetric Flows Dec 5, 2023 All Quantization
— Unverified 0Unified learning-based lossy and lossless JPEG recompression Dec 5, 2023 Image Compression Quantization
— Unverified 0PLUM: Improving Inference Efficiency By Leveraging Repetition-Sparsity Trade-Off Dec 4, 2023 Binarization Computational Efficiency
Code Code Available 0Low-Precision Mixed-Computation Models for Inference on Edge Dec 3, 2023 Quantization
— Unverified 0QuantAttack: Exploiting Dynamic Quantization to Attack Vision Transformers Dec 3, 2023 Quantization
Code Code Available 0