SLiM: One-shot Quantization and Sparsity with Low-rank Approximation for LLM Weight Compression Oct 12, 2024 Model Compression Natural Language Understanding
Code Code Available 1PrivQuant: Communication-Efficient Private Inference with Quantized Network/Protocol Co-Optimization Oct 12, 2024 Quantization
— Unverified 0QEFT: Quantization for Efficient Fine-Tuning of LLMs Oct 11, 2024 parameter-efficient fine-tuning Quantization
Code Code Available 0ZipVL: Efficient Large Vision-Language Models with Dynamic Token Sparsification Oct 11, 2024 MME Quantization
— Unverified 0DeltaDQ: Ultra-High Delta Compression for Fine-Tuned LLMs via Group-wise Dropout and Separate Quantization Oct 11, 2024 Diversity Quantization
— Unverified 0ACCEPT: Adaptive Codebook for Composite and Efficient Prompt Tuning Oct 10, 2024 Natural Language Understanding parameter-efficient fine-tuning
Code Code Available 0M^2-ViT: Accelerating Hybrid Vision Transformers with Two-Level Mixed Quantization Oct 10, 2024 Efficient ViTs Quantization
— Unverified 0DART: Denoising Autoregressive Transformer for Scalable Text-to-Image Generation Oct 10, 2024 Denoising Image Generation
— Unverified 0Scalable Representation Learning for Multimodal Tabular Transactions Oct 10, 2024 Decoder Quantization
— Unverified 0Q-VLM: Post-training Quantization for Large Vision-Language Models Oct 10, 2024 Language Modeling Language Modelling
Code Code Available 2MotionAura: Generating High-Quality and Motion Consistent Videos using Discrete Diffusion Oct 10, 2024 Denoising parameter-efficient fine-tuning
Code Code Available 0Hallo2: Long-Duration and High-Resolution Audio-Driven Portrait Image Animation Oct 10, 2024 4k Image Animation
Code Code Available 7CrossQuant: A Post-Training Quantization Method with Smaller Quantization Kernel for Precise Large Language Model Compression Oct 10, 2024 Language Modeling Language Modelling
— Unverified 0QuAILoRA: Quantization-Aware Initialization for LoRA Oct 9, 2024 Causal Language Modeling GPU
— Unverified 0Perceptual Quality Assessment of Trisoup-Lifting Encoded 3D Point Clouds Oct 9, 2024 Point Cloud Quality Assessment Quantization
Code Code Available 0Scaling Laws for Mixed quantization in Large Language Models Oct 9, 2024 Quantization
— Unverified 0JPEG Inspired Deep Learning Oct 9, 2024 Deep Learning Fine-Grained Image Classification
Code Code Available 0Gesture2Text: A Generalizable Decoder for Word-Gesture Keyboards in XR Through Trajectory Coarse Discretization and Pre-training Oct 8, 2024 Decoder Quantization
— Unverified 0QERA: an Analytical Framework for Quantization Error Reconstruction Oct 8, 2024 parameter-efficient fine-tuning Quantization
— Unverified 0Accelerating Error Correction Code Transformers Oct 8, 2024 Quantization
Code Code Available 0QT-DoG: Quantization-aware Training for Domain Generalization Oct 8, 2024 Domain Generalization Model Compression
Code Code Available 1MC-MoE: Mixture Compressor for Mixture-of-Experts LLMs Gains More Oct 8, 2024 Mixture-of-Experts Quantization
Code Code Available 2Covering Numbers for Deep ReLU Networks with Applications to Function Approximation and Nonparametric Regression Oct 8, 2024 Quantization regression
— Unverified 0Restructuring Vector Quantization with the Rotation Trick Oct 8, 2024 Quantization
Code Code Available 4Variable Bitrate Residual Vector Quantization for Audio Coding Oct 8, 2024 Audio Compression Quantization
— Unverified 0Integrated Encoding and Quantization to Enhance Quanvolutional Neural Networks Oct 8, 2024 Quantization Quantum Machine Learning
Code Code Available 0Designing a Classifier for Active Fire Detection from Multispectral Satellite Imagery Using Neural Architecture Search Oct 7, 2024 Fire Detection Neural Architecture Search
— Unverified 0Variable Resolution Pixel Quantization for Low Power Machine Vision Application on Edge Oct 7, 2024 Edge-computing image-classification
— Unverified 0PrefixQuant: Eliminating Outliers by Prefixed Tokens for Large Language Models Quantization Oct 7, 2024 Common Sense Reasoning Quantization
Code Code Available 2Continuous Approximations for Improving Quantization Aware Training of LLMs Oct 6, 2024 MMLU Model Compression
— Unverified 0HALL-E: Hierarchical Neural Codec Language Model for Minute-Long Zero-Shot Text-to-Speech Synthesis Oct 6, 2024 Language Modeling Language Modelling
— Unverified 0PalmBench: A Comprehensive Benchmark of Compressed Large Language Models on Mobile Platforms Oct 5, 2024 Benchmarking GPU
— Unverified 0MIMO Detection with Spatial Sigma-Delta ADCs: A Variational Bayesian Approach Oct 4, 2024 Quantization
— Unverified 0Resource-aware Mixed-precision Quantization for Enhancing Deployability of Transformers for Time-series Forecasting on Embedded FPGAs Oct 4, 2024 Neural Architecture Search Quantization
— Unverified 0Generative Semantic Communication for Text-to-Speech Synthesis Oct 4, 2024 Quantization Semantic Communication
— Unverified 0ARB-LLM: Alternating Refined Binarizations for Large Language Models Oct 4, 2024 Binarization Quantization
Code Code Available 1Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization Oct 4, 2024 Deep Reinforcement Learning Quantization
Code Code Available 1EXAQ: Exponent Aware Quantization For LLMs Acceleration Oct 4, 2024 Quantization Question Answering
Code Code Available 0Lightweight Diffusion Models for Resource-Constrained Semantic Communication Oct 3, 2024 Quantization Semantic Communication
Code Code Available 1Overcoming Representation Bias in Fairness-Aware data Repair using Optimal Transport Oct 3, 2024 Attribute Fairness
— Unverified 0SageAttention: Accurate 8-Bit Attention for Plug-and-play Inference Acceleration Oct 3, 2024 Image Generation Quantization
Code Code Available 7SEAL: SEmantic-Augmented Imitation Learning via Language Model Oct 3, 2024 Decision Making Imitation Learning
— Unverified 0Remember and Recall: Associative-Memory-based Trajectory Prediction Oct 3, 2024 Autonomous Driving Computational Efficiency
— Unverified 0A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation Oct 2, 2024 Image Generation Quantization
Code Code Available 2Restorative Speech Enhancement: A Progressive Approach Using SE and Codec Modules Oct 2, 2024 Quantization Speech Enhancement
— Unverified 0Locret: Enhancing Eviction in Long-Context LLM Inference with Trained Retaining Heads on Consumer-Grade Devices Oct 2, 2024 GPU Language Modeling
Code Code Available 1ImageFolder: Autoregressive Image Generation with Folded Tokens Oct 2, 2024 Image Generation Image Reconstruction
Code Code Available 3Getting Free Bits Back from Rotational Symmetries in LLMs Oct 2, 2024 Quantization
— Unverified 0Compressing Recurrent Neural Networks for FPGA-accelerated Implementation in Fluorescence Lifetime Imaging Oct 1, 2024 Computational Efficiency Knowledge Distillation
— Unverified 0Deep activity propagation via weight initialization in spiking neural networks Oct 1, 2024 Quantization
— Unverified 0