Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets Feb 29, 2024 Image Compression Quantization
— Unverified 0T3DNet: Compressing Point Cloud Models for Lightweight 3D Recognition Feb 29, 2024 Autonomous Driving Quantization
— Unverified 0FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization Feb 28, 2024 GPU Quantization
— Unverified 0No Token Left Behind: Reliable KV Cache Compression via Importance-Aware Mixed Precision Quantization Feb 28, 2024 Quantization
— Unverified 0Ef-QuantFace: Streamlined Face Recognition with Small Data and Low-Bit Precision Feb 28, 2024 Face Recognition Quantization
— Unverified 0Inpainting Computational Fluid Dynamics with Deep Learning Feb 27, 2024 Deep Learning Quantization
— Unverified 0Neural Video Compression with Feature Modulation Feb 27, 2024 Blocking Quantization
— Unverified 0Rethinking Mutual Information for Language Conditioned Skill Discovery on Imitation Learning Feb 27, 2024 Imitation Learning Quantization
— Unverified 0Adaptive quantization with mixed-precision based on low-cost proxy Feb 27, 2024 Neural Architecture Search Quantization
— Unverified 0SPC-NeRF: Spatial Predictive Compression for Voxel Based Radiance Field Feb 26, 2024 Image Compression NeRF
— Unverified 0Distortion-Controlled Dithering with Reduced Recompression Rate Feb 26, 2024 Data Compression Image Compression
— Unverified 0A Comprehensive Evaluation of Quantization Strategies for Large Language Models Feb 26, 2024 Language Modeling Language Modelling
Code Code Available 0Data-freeWeight Compress and Denoise for Large Language Models Feb 26, 2024 GPU Quantization
— Unverified 0EncodingNet: A Novel Encoding-based MAC Design for Efficient Neural Network Acceleration Feb 25, 2024 Efficient Neural Network image-classification
Code Code Available 0Towards Accurate Post-training Quantization for Reparameterized Models Feb 25, 2024 Quantization
Code Code Available 0GPTVQ: The Blessing of Dimensionality for LLM Quantization Feb 23, 2024 CPU Quantization
— Unverified 0Text me the data: Generating Ground Pressure Sequence from Textual Descriptions for HAR Feb 22, 2024 Activity Recognition Human Activity Recognition
— Unverified 0On the Arrow of Inference Feb 22, 2024 counterfactual Counterfactual Reasoning
— Unverified 0FinGPT-HPC: Efficient Pretraining and Finetuning Large Language Models for Financial Applications with High-Performance Computing Feb 21, 2024 GPU Model Compression
— Unverified 0APTQ: Attention-aware Post-Training Mixed-Precision Quantization for Large Language Models Feb 21, 2024 Quantization
— Unverified 0In-Distribution Consistency Regularization Improves the Generalization of Quantization-Aware Training Feb 21, 2024 Knowledge Distillation Quantization
— Unverified 0Tiny Reinforcement Learning for Quadruped Locomotion using Decision Transformers Feb 20, 2024 Imitation Learning Quantization
Code Code Available 0Towards a tailored mixed-precision sub-8-bit quantization scheme for Gated Recurrent Units using Genetic Algorithms Feb 19, 2024 Model Compression Quantization
— Unverified 0Is It a Free Lunch for Removing Outliers during Pretraining? Feb 19, 2024 Quantization
— Unverified 0WKVQuant: Quantizing Weight and Key/Value Cache for Large Language Models Gains More Feb 19, 2024 Quantization Text Generation
— Unverified 0DB-LLM: Accurate Dual-Binarization for Efficient LLMs Feb 19, 2024 Binarization Computational Efficiency
— Unverified 0QDyLoRA: Quantized Dynamic Low-Rank Adaptation for Efficient Large Language Model Tuning Feb 16, 2024 GPU Language Modeling
— Unverified 0One-Bit Quantization and Sparsification for Multiclass Linear Classification with Strong Regularization Feb 16, 2024 Classification Quantization
— Unverified 0Quantized Embedding Vectors for Controllable Diffusion Language Models Feb 15, 2024 Language Modeling Language Modelling
— Unverified 0Model Compression and Efficient Inference for Large Language Models: A Survey Feb 15, 2024 Knowledge Distillation Model Compression
— Unverified 0Multi-Excitation Projective Simulation with a Many-Body Physics Inspired Inductive Bias Feb 15, 2024 Explainable artificial intelligence Explainable Artificial Intelligence (XAI)
Code Code Available 0Lightweight Deep Learning Based Channel Estimation for Extremely Large-Scale Massive MIMO Systems Feb 14, 2024 Quantization
Code Code Available 0Towards Next-Level Post-Training Quantization of Hyper-Scale Transformers Feb 14, 2024 Quantization
— Unverified 0Rate-Splitting Multiple Access for Quantized ISAC LEO Satellite Systems: A Max-Min Fair Energy-Efficient Beam Design Feb 14, 2024 Fairness ISAC
— Unverified 0BdSLW60: A Word-Level Bangla Sign Language Dataset Feb 13, 2024 Benchmarking Gesture Recognition
Code Code Available 0TeMPO: Efficient Time-Multiplexed Dynamic Photonic Tensor Core for Edge AI with Compact Slow-Light Electro-Optic Modulator Feb 12, 2024 Quantization
— Unverified 0Outlier-Aware Training for Low-Bit Quantization of Structural Re-Parameterized Networks Feb 11, 2024 Quantization
— Unverified 0LiRank: Industrial Large Scale Ranking Models at LinkedIn Feb 10, 2024 Click-Through Rate Prediction Quantization
— Unverified 0On Leaky-Integrate-and Fire as Spike-Train-Quantization Operator on Dirac-Superimposed Continuous-Time Signals Feb 10, 2024 Quantization
— Unverified 0RQP-SGD: Differential Private Machine Learning through Noisy SGD and Randomized Quantization Feb 9, 2024 Privacy Preserving Quantization
— Unverified 0Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization for Enhanced Time Series Forecasting Feb 8, 2024 Computational Efficiency Multivariate Time Series Forecasting
— Unverified 0RepQuant: Towards Accurate Post-Training Quantization of Large Transformer Models via Scale Reparameterization Feb 8, 2024 Quantization
— Unverified 0L4Q: Parameter Efficient Quantization-Aware Fine-Tuning on Large Language Models Feb 7, 2024 Few-Shot Learning In-Context Learning
— Unverified 0Majority Kernels: An Approach to Leverage Big Model Dynamics for Efficient Small Model Training Feb 7, 2024 Combinatorial Optimization Computational Efficiency
— Unverified 0Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes Feb 6, 2024 Federated Learning Model Compression
— Unverified 0Multimodal Unsupervised Domain Generalization by Retrieving Across the Modality Gap Feb 6, 2024 Domain Generalization Quantization
Code Code Available 0A Survey on Transformer Compression Feb 5, 2024 Knowledge Distillation Mamba
— Unverified 0Optimal and Near-Optimal Adaptive Vector Quantization Feb 5, 2024 Quantization
— Unverified 0Quantized Approximately Orthogonal Recurrent Neural Networks Feb 5, 2024 Quantization Time Series
— Unverified 0FoldToken: Learning Protein Language via Vector Quantization and Beyond Feb 4, 2024 Quantization
— Unverified 0